WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
C07H 21/04 



Al 



(11) International Publication Number: WO 99/02547 

(43) International Publication Date: 21 January 1999 (21.01.99) 



(21) International Application Number: PCT/US98/14101 

(22) International Filing Date: 7 July 1998 (07.07.98) 



(30) Priority Data: 

08/888,429 



7 July 1997 (07.07.97) 



US 



(71) Applicant: UNIVERSITY OF MASSACHUSETTS [US/US]; 

Suite 800, 18 Tremont Street, Boston, MA 02108 (US). 

(72) Inventors: DAVIS, Roger, J.; 53 Hickory Drive, Prince- 

ton, MA 01541 (US). WHITMARSH, Alan; Apartment 
K, 50 Shrewsbury Green, Shrewsbury, MA 01545 (US). 
TOURNIER, Cathy; 10 Liscomb Street, Worcester, MA 
01604 (US). 

(74) Agent: FASSE, J., Peter; Fish & Richardson P.C., 225 Franklin 
Street, Boston, MA 02110-2804 (US). 



(81) Designated States: AU, CA, JP, KR, European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE). 



Published 

With international search report. 



(54) Title: CYTOKINE-, STRESS-, AND ONCOPROTEIN-ACTIVATED HUMAN PROTEIN KINASE KINASES 



(57) Abstract 

Disclosed are human mitogen-activated (MAP) 
kinase kinase isoforms (MKKs), MKKs mediate unique 
signal transduction pathways that activate human MAP 
kinases p38 and JNK, which result in activation of 
other factors, including activating transcription factor-2 
(ATF2) and c-Jun. The pathways are activated by a 
number of factors, including cytokines and environmen- 
tal stress. Methods are provided for identifying reagents 
that modulate MKK function or activity and for the use 
of such reagents in the treatment of MKK-mediated dis- 
orders. 
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CYTOKINE - , STRESS - , AND ONCOPROTEIN- ACTIVATED HUMAN 

PROTEIN KINASE KINASES 

5 Background of the Invention 

This invention relates to protein kinases. 
Mitogen-activated protein (MAP) kinases are 
important mediators of signal transduction from the cell 
surface to the nucleus. Multiple MAP kinases have been 
10 described in yeast including SMK1 , HOG1 , MPK1 , FUS3 , and 
KSSi. In mammals, the MAP kinases identified are 
extracellular signal -regulated MAP kinase (ERK) , c-Jun 
amino -terminal kinase (JNK) , and p38 kinase (Davis (1994) 
Trends Biochem. Sci . 19:470). These MAP kinase isoforms 
15 are activated by dual phosphorylation on threonine and 
tyrosine . 

Activating Transcription Factor-2 (ATF2 ) , ATFa , 
and cAMP Response Element Binding Protein (CRE-BPa) are 
related transcription factors that bind to similar 

20 sequences located in the promoters of many genes (Ziff 
(1990) Trends in Genet. 6:69) . The binding of these 
transcription factors leads to increased transcriptional 
activity. ATF2 binds to several viral proteins, 
including the oncoprotein Ela (Liu and Green (1994) 

25 Nature 368:520), the hepatitis B virus X protein (Maguire 
et al. (1991) Science 252:842), and the human T cell 
leukemia virus 1 tax protein (Wagner and Green (1993) 
Science 262:395). ATF2 also interacts with the tumor 
suppressor gene product Rb (Kim et al . (1992) Nature 

3 0 358:331), the high mobility group protein HMG ( I ) Y (Du et 
al. (1993) Cell 74:887), and the transcription factors 
nuclear NF-kB (Du et al . (1993) Cell 74:887) and c-Jun 
(Benbrook and Jones (1990) Oncogene 5:295). 
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Summary of the Invention 
The invention is based cn the identification and 
isolation of a new group of human mitogen-activated 
protein kinase kinases (MKKs) . The MKK isoforms 
5 described herein, MKK3 , MKK6 , MKK4 (including MKK4 - a , - (3 , 
and -y) , MKK7 (including murine MKK7 , human MKK7 , MKK7b, 
MKK7 c , MKK7d, and MKK7e) have serine, threonine, and 
tyrosine kinase activity. MKK3 , MKK4 , and MKK6 
specifically phosphorylate the human MAP kinase p3 8 ac 
10 Thr 130 and Tyr 182 . The MKK4 isofcrms also phosphorylate the 
human MAP kinases JNK (including JNK1 , JNK2 , and JNK5) at 
Thr 133 and Tyr 135 . The MKK7 isofcrms phosphorylate JNK at 
Thr 133 and Tyr 185 . 

Accordingly, the invention features a 
15 substantially pure human MKK polypeptide having serine, 
threonine, and tyrosine kinase activity that specifically 
phosphoryiates human p3 8 MAP kinase. MKK3 has the amino 
acid sequence of SEQ ID NO : 2 . The invention further 
includes MKK6 having the amino acid sequence of SEQ ID 

2 0 NO : 4 and having serine, threonine, and tyrosine kinase 

activity that specifically phosphoryiates human p3 8 MAP 
kinase . 

The invention further features a substantially 
pure human MKK polypeptide having serine, threonine, and 
25 tyrosine kinase activity that specifically phosphoryiates 
human p3 8 MAP kinase and JNK. MKK4 isoform MKK4 - a has 
the amino acid sequence of SEQ ID NO: 6. MKK4 isoform 
MKK4 - (3 has the amino acid sequence of SEQ ID NO:8. MKK4 
isoform MKK4 - y has the amino acid sequence of SEQ ID 

3 0 NO: 10. 

The invention also features a substantially pure 
MKK polypeptide (MKK7) having serine, threonine, and 
tyrosine kinase activity that specifically phosphoryiates 
mitogen-activated protein kinase JNK. MKK isoforms MKK7 
3 5 (murine) and MKK7 (human) have the amino acid sequences 
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of SEQ ID N0S:1S and 26, respectively. The MKK7 isofcrms 
MKK7b, MKK7c, MKK7d , and MKK7e have the amino acid 
sequences of SEQ ID NO: 20, SEQ ID NO: 28, SEQ ID NO: 30, 
and SEQ ID NO: 32, respectively. 
5 As used herein, the term "mitogen-activacing 

protein kinase kinase" or "MKK" means a protein kinase 
which possesses the characteristic activity of 
phosphorylating and activating a human mitogen-activacing 
protein kinase. Examples of MKKs include MKK3 and 

0 MKK6, which specifically phosphorylate and activate p38 
MAP kinase at Thr 130 and Tyr 182 , MKK4 isoforms which 
specifically phosphorylate and activate p3 8 MAP kinase at 
Thr lsc and Tyr 132 , and JNK at Thr 153 and Tyr 135 , and MKK7 
isoforms which specifically phosphorylate JNK az Thr 183 

5 and Tyr 185 . 

An " MKK7 " is a mammalian isoform of mitogen- 
activated protein kinase kinase (MKK) polypeptide having 
serine, threonine, and tyrosine kinase activity, and 
phosphorylating mitogen-activated protein (MAP) kinase 

0 JNK but not p3 8 . 

The invention includes the specific p3 8 and JNK 
MKKs disclosed, as well as closely related MKKs which are 
identified and isolated by the use of probes or 
antibodies prepared from the polynucleotide and amino 

5 acid sequences disclosed for the MKKs of the invention. 
This can be done using standard techniques, e.g., by 
screening a genomic, cDNA, or combinatorial chemical 
library with a probe having all or a part of the nucleic 
acid sequences of the disclosed MKKs. The invention 

0 further includes synthetic polynucleotides having all or 
part of the amino acid sequence of the MKKs herein 
described. 

The term "polypeptide" means any chain of amino 
acids, regardless of length or post - translational 
5 modification (e.g., glycosylation or phosphorylation), 
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and includes natural proteins as well as synthetic or 
recombinant polypeptides and peptides. 

The term "substantially pure, " when referring to a 
polypeptide, means a polypeptide that is at lease 60%, by 
5 weight, free from the proteins and naturally-occurring 
organic molecules with which it is naturally associated. 
A substantially pure MKK polypeptide (e.g., human) is at 
least 75%, more preferably at least 90%, and most 
preferably at least 99%, by weight, MKK polypeptide. A 

10 substantially pure MKK can be obtained, for example, by 
extraction from a natural source; by expression of a 
recombinant nucleic acid encoding a MKK polypeptide, or 
by chemically synthesizing the protein. Purity can be 
measured by any appropriate method, e.g., column 

15 chromatography, polyacrylamide gel electrophoresis, or 
HPLC analysis. 

In one aspect, the invention features isolated 
polynucleotides which encode the MKKs of the invention. 
In one embodiment, the polynucleotide is the nucleotide 

2 0 sequence of SEQ ID NO:l. In other embodiments, the 

polynucleotide is the nucleotide sequence of SEQ ID NO: 3, 
SEQ ID NO:5, SEQ ID NO : 7 , SEQ ID NO : 9 , SEQ ID NO:17, SEQ 
ID NO:19, SEQ ID NO : 2 5 , SEQ ID NO : 2 7 , SEQ ID NO : 2 9 , or 
SEQ ID NO:31, respectively. 
25 As used herein, "polynucleotide 11 refers to a 

nucleic acid sequence of deoxyribonucleot ides or 
ribonucleotides in the form of a separate fragment or a 
component of a larger construct. DNA encoding portions 
or all of the polypeptides of the invention can be 

3 0 assembled from cDNA fragments or from oligonucleotides 

that provide a synthetic gene which can be expressed in a 
recombinant transcriptional unit. Polynucleotide 
sequences of the invention include DNA, RNA, and cDNA 
sequences, and can be derived from natural sources or 
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synthetic sequences synthesized by methods known to the 
art . 

An "isolated" polynucleotide is a nucleic acid 
molecule that is separated in some way from sequences in 
5 the naturally occurring genome cf an organism. Thus, the 
term "isolated polynucleotide" includes any nucleic acid 
molecules that are not naturally occuring. The term 
therefore includes, for example, a recombinant 
polynucleotide which is incorporated into a vector, into 
0 an autonomously replicating plasmid or virus, or into the 
genomic DNA of a prokaryote or eukaryote, or which exists 
as a separate molecule independent: of other sequences. 
It also includes a recombinant DNA which is part of a 
hybrid gene encoding additional polypeptide sequences. 

5 The isolated polynucleotide sequences of the 

invention also include polynucleotide sequences that 
hybridize under stringent conditions to the 
polynucleotide sequences specified herein. The term 
"stringent conditions" means hybridization conditions 

0 that guarantee specificity between hybridizing 

polynucleotide sequences, such as those described herein, 
or more stringent conditions. One skilled in the art can 
select posthybridizaticn washing conditions, including 
temperature and salt concentrations, which reduce the 

5 number of nonspecific hybridizations such that cnly 

highly complementary sequences are identified (Sambrook 
et al. (1989) in Molecular Cloning . 2d ed. ; Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY. 

The isolated polynucleotide sequences of the 

0 invention also include sequences complementary to the 
polynucleotides encoding MKK (antisense sequences) . 
Antisense nucleic acids are DNA or RNA molecules that are 
complementary to at least a portion of a specific mRNA 
molecule (Weintraub (1990) Scientific American 262:40). 

5 The invention includes all antisense polynucleotides that 
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inhibit production of MKK polypeptides. In the cell, the 
antisense nucleic acids hybridize to the corresponding 
mRNA, forming a double-stranded molecule. Antisense 
oligomers of about 15 nucleotides are preferred, since 
5 they are easily synthesized and introduced into a target 
MKK-producing cell. The use of antisense methods to 
inhibit the translation of genes is known in the art, and 
is described, e.g., in Marcus-Sakura Anal. Biochem. , 
172 :289 (1988) . 

0 In addition, ribozyme nucleotide sequences for MKK 

are included in the invention. Ribozymes are RNA 
molecules possessing the ability to specifically cleave 
other single- stranded RNA in a manner analogous :o DNA 
restriction endonucleases . Through the modification of 
nucleotide sequences encoding these RNAs , molecules can 
be engineered to recognize specific nucleotide sequences 
in an RNA molecule and cleave it (Cech (1988) J. Amer. 
Med. Assn. 260:3030). A major advantage of this approach 
is that, because they are sequence-specific, only mRNAs 
with particular sequences are inactivated. 

There are two basic types of ribozymes, 
t e trahyrnena - type (Hasselhoff (1988) Nature 334:555) and 
"hammerhead" -type . Te trahyrnena -type ribozymes recognize 
sequences which are four bases in length, while 
"hammerhead" -type ribozymes recognize base sequences Il- 
ls bases in length. The longer the sequence, the greater 
the likelihood that the sequence will occur exclusively 
in the target mRNA species. Consequently, hammerhead- 
type ribozymes are preferable to te trahyrnena -type 
ribozymes for inactivating a specific mRNA species, and 
18 -base recognition sequences are preferable to shorter 
recognition sequences . 

The MKK polypeptides can also be used to produce 
antibodies that are immunoreactive or bind epitopes of 
the MKK polypeptides. Accordingly, one aspect of the 
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invention features antibodies to the MKK polypeptides of 
the invention. The antibodies of the invention include 
polyclonal antibodies which include pooled monoclonal 
antibodies with different epitopic specificities, as well 
5 as distinct monoclonal antibody preparations. Monoclonal 
antibodies are made from antigen-containing fragments of 
the MKK polypeptide by methods known in the art (see, for 
example, Kohler et al . (1975) Nature 256:495). 

The term "antibody" as used herein includes intact 

10 molecules as well as fragments thereof, such as Fa, 

F(ab') 2 , and Fv, which are capable of binding an epitopic 
determinant. Antibodies that specifically bind MKK 
polypeptides can be prepared using intact polypeptides or 
fragments containing small peptides of interest as the 

15 immunizing antigen. The polypeptide or peptide used to 
immunize an animal can be derived from translated cDNA or 
chemically synthesized, and can be conjugated to a 
carrier protein, if desired. Commonly used carriers that 
are chemically coupled to peptides include bovine serum 

2 0 albumin and thyroglobulin . The coupled peptide is then 

used to immunize the animal (e.g., a mouse, a rat, or a 
rabbit) . 

A molecule (e.g., antibody) that "specifically 
binds" is one that binds to a particular polypeptide, 
25 e.g., MKK7 , but that does not substantially reccginze or 
bind to other molecules in a sample, e.g., a biological 
sample which includes MKK7 . References to constructs 
made of an antibody (or fragment thereof) coupled to a 
compound comprising a detectable marker include 

3 0 constructs made by any technique, including chemical 

means and recombinant techniques. 

The invention also features methods of identifying 
subjects at risk for MKK-mediated disorders by measuring 
activation of the MKK signal transduction pathway. 
3 5 Activation of the MKK signal transduction pathway can be 
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determined by measuring MKK synthesis; activation of MKK 
isoforms; activation of MKK substrates p3 8 or JNK 
isoforms; or activation of p3 8 and JNK substrates such as 
ATF2, ATFa , CRE-BPa , and c-Jun. The term 11 JNK" cr "JNK 
5 isoforms" includes JNK1 , JNK2 , and JNK3 . The term "MKK 
substrate" as used herein includes MKK substrates, as 
well as MKK substrate substrates, e.g., p38, JNK, ATF2 , 
and c-Jun. 

In one embodiment, activation of the MKK signal 
0 transduction pathway is determined by measuring 

activation of the appropriate MKK signal transduction 
pathway substrates (for example, selected from p3 8, JNK 
isoforms, ATF2 , ATFa, CRE-BPa, cr c-Jun). MKK activity 
is measured by the rate of substrate phosphorylation as 
5 determined by quantitation of the rate of labelled 

phosphorus (e.g., [ 32 ] P or [ 33 ] P) incorporation. This can 
also be measured using phosphoryiation-specif ic reagents, 
such as antibodies. The specificity of MKK substrate 
phosphorylation can be tested by measuring p3 8 
activation, JNK activation, or both, or by employing 
mutated p38 or JNK molecules that lack the sites for MKK 
phosphorylations. Altered phosphorylation of the 
substrate relative to control values indicates alteration 
of the MKK signal transduction pathway, and increased 
risk in a subject cf an MKK-mediated disorder. MKK 
activation of p38 and JNK can be detected in a coupled 
assay with the MKK signal transduction substrate ATF2 , or 
related compounds such as ATFa and CRE-BPa. Activation 
can also be detected with the substrate c-Jun. When ATF2 
is included in the assay, it is present as an intact 
protein or as a fragment of the intact protein, e.g., the 
activation domain (residues 1-109, or a portion thereof) . 
ATF2 is incubated with a test sample in which MKK 
activity is to be measured and [-y- 32 P]ATP, under 
conditions sufficient to allow the phosphorylation of 
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ATF2 . ATF2 is then isolated and the amount of 
phosphorylation quant itated. In a specific embodiment, 
ATF2 is isolated by immunoprecipitation, resolved by SDS- 
PAGE, and detected by autoradicgraphy . 
5 In another embodiment, activation of the MKK 

signal transduction pathway is determined by measuring 
the level of MKK expression in a test sample. In a 
specific embodiment, the level of MKK expression is 
measured by Western blot analysis. The proteins present 
10 in a sample are fractionated by gel electrophoresis, 
transferred to a membrane, and probed with labeled 
antibodies to MKK. In another specific embodiment, the 
level of MKK expression is measured by Northern blot 
analysis. Total cellular or polyadenylated [poly (A)*] 
15 mRNA is isolated from a test sample. The RNA is 

fractionated by electrophoresis and transferred to a 
membrane. The membrane is probed with labeled MKK cDNA. 
In another embodiment, MKK expression is measured by 
quantitative PGR applied to expressed mRNA. 
20 The MKKs of the invention are useful for screening 

reagents that modulate MKK activity. MKKs are activated 
by phosphorylation. Accordingly, in one aspect, the 
invention features methods for identifying a reagent 
which modulates MKK activity, by incubating MKK with the 
25 test reagent and measuring the effect of the test reagent 
on MKK synthesis, phosphorylation, function, or activity. 
In one embodiment, the test reagent is incubated with MKK 
and [ 32 ]P-ATP, and the rate of MKK phosphorylation 
determined, as described above. In another embodiment, 
30 the test reagent is incubated with a cell transfected 
with an MKK polynucleotide expression vector, and the 
effect of the test reagent on MKK transcription is 
measured by Northern blot analysis, as described above. 
In a further embodiment, the effect of the test reagent 
35 on MKK synthesis is measured by Western blot analysis 
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using an antibody no MKK. In still another embodiment , 
the effect of a reagent on MKK activity is measured by 
incubating MKK with the test reagent, [ 32 ]P-ATP, and a 
substrate in the MKK signal transduction pathway, 
5 including one or more of p38, JNK, and ATF2 . The rate of 
substrate phosphorylation is determined as described 
above . 

The term "modulation of MKK activity" includes 
inhibitory or stimulatory effects. 
10 The invention is particularly useful for screening 

reagents that inhibit MKK activity. Such reagents are 
useful for the treatment or prevention of MKK-mediated 
disorders, for example, inflammation and oxidative 
damage . 

15 The invention further features a method of 

treating a MKK-mediated disorder by administering to a 
subject in need thereof, an effective dose of a 
therapeutic reagent that inhibits the activity of MKK. 
An "MKK-mediated disorder" is a pathological 

20 condition resulting, at least in part, from excessive 
activation of an MKK signal transduction pathway. The 
MKK signal transduction pathways are activated by several 
factors, including inflammation and stress. MKK-mediated 
disorders include, for example, ischemic heart disease, 

25 burns due to heat or radiation (UV, X-ray, y, (3, etc.), 
kidney failure, liver damage due to oxidative stress or 
alcohol, respiratory distress syndrome, septic shock, 
rheumatoid arthritis, autoimmune disorders, and other 
types of inflammatory diseases. 

3 0 A "therapeutic reagent" any compound or molecule 

that achieves the desired effect on an MKK-mediated 
disorder when administered to a subject in need thereof, 

MKK-mediated disorders further include 
proliferative disorders, particularly disorders that are 
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stress-related. Examples of stress-related MKK-mediated 
proliferative disorders are psoriasis, acquired immune 
deficiency syndrome, malignancies of various tissues of 
the body, including malignancies of the skin, bone 
5 marrow, lung, liver, breast, gastrointestinal system, and 
genito-urinary tract. Preferably, therapeutic reagents 
inhibit the activity or expression of MKK inhibit cell 
growth or cause apoptosis. 

A therapeutic reagent that "inhibits MKK activity" 
10 interferes with a MKK-mediated signal transduction 

pathway. For example, a therapeutic reagent can alter 
the protein kinase activity of MKK, decrease the level of 
MKK transcription or translation, e.g., an antisense 
polynucleotide able to bind MKK mRNA, or suppress MKK 
15 phosphorylation of p38, JNK, or ATF2 , thus disrupting the 
MKK-mediated signal transduction pathway. Examples of 
such reagents include antibodies that bind specifically 
to MKK polypeptides, and fragments of MKK polypeptides 
that competitively inhibit MKK polypeptide activity. 
20 A therapeutic reagent that "enhances MKK activity" 

supplements a MKK-mediated signal transduction pathway. 
Examples of such reagents include the MKK polypeptides 
themselves, which can be administered in instances where 
Che MKK-mediated disorder is caused by under expression 
25 of the MKK polypeptide, or expression of a mutant MKK 
polypeptide. In addition, portions of DNA encoding an 
MKK polypeptide can be introduced into cells that under 
express an MKK polypeptide. 

A "therapeutically effective amount" is an amount 
3 0 of a reagent sufficient to decrease or prevent the 
symptoms associated with the MKK-mediated disorder. 

Therapeutic reagents for treatment of MKK-mediated 
disorders identified by the methods of the invention are 
administered to a subject in a number of ways known to 
35 the art, including parenterally by injection, infusion, 
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sustained-release injection or implant, intravenously, 
intraperitoneal ly , intramuscularly, subcutaneous ly, or 
transdermally . Epidermal disorders and disorders of the 
epithelial tissues are treated by topical application of 
5 the reagent. The reagent is mixed with other compounds 
to improve stability and efficiency of delivery (e.g., 
liposomes, preservatives, or dimethyl sulfoxide (DMSO) ) 
Polynucleotide sequences, including antisense sequences, 
can be therapeutically administered by techniques known 
0 to the art resulting in introduction into the cells of a 
subject suffering from the MKK-mediated disorder. These 
methods include the use of viral vectors (e.g., 
retrovirus, adenovirus, vaccinia virus, or herpes virus), 
colloid dispersions, and liposomes. 
5 The materials of the invention are ideally suited 

for the preparation of a kit for the detection of the 
level or activity of MKK. Accordingly, the invention 
features a kit comprising an antibody that binds MKK, or 
a nucleic acid probe that hybridizes to a MKK 
0 polynucleotide, and suitable buffers. The probe or 

monoclonal antibody can be labeled to detect binding to a 
MKK polynucleotide cr protein. In a preferred 
embodiment, the kit features a labeled antibody to MKK. 
Unless otherwise defined, all technical and 
5 scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art 
to which this invention belongs. Although methods and 
materials similar or equivalent to those described herein 
can be used in the practice or testing of the present 
) invention, suitable methods and materials are described 
below. All publications, patent applications, patents, 
and other references mentioned herein are incorporated by 
reference in their entirety. In addition, the materials, 
methods, and examples are illustrative only and not 
> intended to be limiting. 
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Other features and advantages of the invention 
will be apparent: from the detailed description, and from 
the claims. 



Detailed Description 
5 The drawings will first be described. 

Drawings 

Fig. 1 is a comparison of the amino acid sequences 
of MKK3 (SEQ ID NO : 2 ) , MKK4 - at (SEQ ID NO:6), the human 
MAP kinase kinases MEK1 (SEQ ID NO: 11) and MEK2 (SEQ ID 

10 NO:12), and the yeast HOG1 MAP kinase kinase PBS2 (SEQ ID 
NO:13). Sequences were compared using the PILE-UP 
program (version 7.2; Wisconsin Genetics Computer Group). 
The protein sequences are presented in single letter code 
(A, Ala; C, Cys ; D, Asp; E, Glu; F, Phe ; G, Gly; H, His; 

15 I, lie; K, Lys; L, Leu; M , Met; N, Asn; P, Pro; Q, Gin; 
R, Arg; S, Ser; T, Thr; V, Val ; W, Trp, and Y, Tyr) . The 
PBS2 sequence is truncated at both the NH 2 - (<) and COOH- 
(>) termini. Gaps introduced into the sequences to 
optimize the alignment are illustrated by a dash. 

20 Identical residues are indicated by a period. The sites 
of activating phosphorylation in MEK are indicated by 
asterisks . 

Fig. 2A is a dendrogram showing the relationship 
between members of the human and yeast MAP kinase 

25 kinases. The dendrogram was created by the unweighted 
pair-group method with the use of arithmetic averages 
(PILE-UP program) . The human (hu) MAP kinase kinases 
MEK1, MEK2 , MKK3 , and MKK4 ; the Saccharomyces cerevisiae 
(sc) MAP kinase kinases PBS2 , MKK1 , and STE7 ; and the 

3 0 Saccharomyces pombe (sp) MAP kinase kinases WIS1 and BYR1 
are presented. 

Fig. 2B is a dendrogram showing the relationship 
between MKKs . The dendrogram was created as described 
for Fig. 2A. 
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Fig. 3 is a schematic representation of the ERK, 
p38, and JNK signal transduction pathways. MEK1 and MEK2 
are activators of the ERK subgroup of MAP kinase. MKK3 
and MKK4 are activators of the p3 8 MAP kinase. MKK4 is 
5 identified as an activator of both the p3 8 and JIZK 
subgroups of MAP kinase. 

Figs. 4A-4D are a representation of the nucleic 
acid (SEQ ID NO:l) and amino acid sequences (SEQ ID NO: 2) 
for MKK3 . 

0 Figs. 5A-5C are a representation of the nucleic 

acid (SEQ ID NO: 3) and amino acid sequences (SEQ ID NO: 4) 
for MKK6 . 

Figs. 6A-6F are a representation of the nucleic 
acid (SEQ ID NO: 5) and amino acid sequences (SEQ ID NO: 6) 
5 for MKK4a. 

Figs. 7A-7F are a representation of the nucleic 
acid (SEQ ID NO: 7) and amino acid sequences (SEQ ID NO : 8 ) 
for MKK4 (3 . 

Figs. 8A-8F are a representation of the nucleic 
D acid (SEQ ID NO: 9) and amino acid sequences (SEQ ID 
NO: 10) for MKK4 y . 

Fig. 9 is a representation of the deduced primary 
structure of MKK7 (SEQ ID NO: 18) compared with hep (SEQ 
ID NO:21) , the MAP kinase kinases MEK1 ( MKK1 ; SEQ ID 
5 NO:ll), MEK2 (MKK2 ; SEQ ID NO:12), MKK3 (SEQ IDNO;2), 
MKK4 y (SEQ ID NO:10), MKK5 (SEQ ID NO:22), and MKK6 (SEQ 
ID NO:4) using the PILE-UP program (version 7,2; 
Wisconsin Genetics Computer Group) . Gaps introduced into 
the sequences to optimize the alignment are illustrated 
) with a dash (-) . Identity is indicated with a dot (.) . 
The sites of activating phosphorylation of MAP kinase 
kinases (2, 27, 37, and 38) are indicated with asterisks 
(*) . 
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Figs. 10A-10D are a representation of the nucleic 
acid (SEQ ID NO: 17) and amino acid (SEQ ID NO: IS) 
sequences for MKK7 . 

Figs. 11A-11D are a representation of the nucleic 
5 acid (SEQ ID NO: 19) and amino acid (SEQ ID NO:2C) 
sequences of MKK7b. 

Figs. 12A-12B are a representation of the nucleic 
acid (SEQ ID NO: 25) and amino acid (SEQ ID NO: 26) 
sequences of human MKK7 . 
10 Figs. 13A-13D are a representation of the nucleic 

acid (SEQ ID NO:27) and amino acid (SEQ ID NO:28) 
sequences of murine MKK7c. 

Figs. 14A-14D are a representation of the nucleic 
acid (SEQ ID NO: 29) and amino acid (SEQ ID NO: 3 0) 
15 sequences of murine MKK7d. 

Figs. 15A-15D are a representation of the nucleic 
acid (SEQ ID NO: 31) and amino acid (SEQ ID NO: 32) 
sequences of murine MKK7e . 

Fig. 16A is a graph of data from a transfection 
20 assay in which cells were co- transf ected with AP-1 

reporter plasmid pTRE-Lucif erase with expression vectors 
for MKK4 , MKK7 , JNK1 , JNK1 (APF) , or control vector. 

Fig. 1SB is a graph of a transfection assay in 
which cells were co- transf ected with a GAL4-ATF2 fusion 
2 5 vector and an expression vector for MKK4 , MKK7 , JNK1 , 
JNK1 (APF) , or control vector. 

Human Mito aen-Act ivated Protein Kinase Kinases 

The human MAP kinase kinases MKK3 and MKK4 
(MKK3/4), and MKK7 , described herein mediate the 
30 transduction of specific signals from the cell surface to 
the nucleus along specific pathways. These signal 
transduction pathways are initiated by factors such as 
cytokines, UV radiation, osmotic shock, and oxidative 
stress. Activation of MKK3/4, MKK6 , and MKK7 results in 
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activation of the MAP kinases. p38 is activated by MKK3 
and MKK4. JNK is activated by MKK4 and MKK7 . p38 and JNK 
in turn activate a group of related transcription factors 
such as ATF2 , ATFa , and CRE-BPa. These transcription 
5 factors in turn activate expression of specific genes. 
For example, ATF2 in known to activate expression of 
human T cell leukemia virus 1 (Wagner and Green (1993) 
Science 262:395), transforming growth factor-b2 (Kim et 
al. (1992) supra), interferon-/? (Du et al . (1993) Cell 
0 74:887), and E-seiectin (DeLuca et al . (1994) J. Biol. 
Chem. 269:19193). In addition, ATF2 is implicated in the 
function of a T ceil -specif ic enhancer (Georgopculos et 
al. (1992) Mol. Cell. Biol. 12:747). 

The JNK group of MAP kinases is activated by 
5 exposure of cells to environmental stress or by treatment 
of cells with pro- inflammatory cytokines (Gupta et al . 

(1994) EMBO J. 15:2760-2770; Derijard et al . (1991) Cell 
76:1025-1037; Kyriakis et ai . (1994) Nature 369:156-160; 
Sluss et al . (1994) Mol. Cell. Biol. 14:8376-8384; 
Kallunki et al . (1994) Genes 5c Dev. 8:2996-3007). 
Targets of the JNK signal transduction pathway include 
the transcription factors ATF2 and c-jun (Whitmarsh & 
Davis (1996) J. Mol. Med. 74:589-607). These 
transcription factors are members of the bZIP group that 
bind as homo- and hetero-dimeric complexes to AP-1 and 
AP-l-like sites in the promoters of many genes (Curran & 
Franza (1988) Cell 55:395-397) . JNK binds to an NH 2 - 
terminal region of ATF2 and c-Jun and phosphorylates two 
sites within the activation domain of each transcription 
factor (Derijard et al . (1994) Cell 76:1025-1037; van Dam 
et ai. (1995) EMBO J. 14:1798-1811; Livingstone et al . 

(1995) EMBO J. 14:1785-1797). This phosphorylation leads 
to increased transcriptional activity (Whitmarsh, supra) . 
Together, these biochemical studies indicate that the JNK 
signal transduction pathway contributes to the regulation 
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of AP-1 transcriptional activity in response to cytokines 
and environmental stress (Whitmarsh, supra) . Strong 
support for this hypothesis is provided by genetic 
evidence indicating that the JNK signaling pathway is 
5 required for the normal regulation of AP-1 

transcriptional activity (Yang ez al . (1997) Proc . Natl. 
Acad. Sci. USA, 94:3004-3009). 

JNK is activated by dual phosphorylation on Thr- 
183 and Tyr-185 (Derijard, supra) . MKK4 (also known as 
0 SEKI) was the first MAP kinase kinase identified as a 
component of the JNK signal transduction pathway 
(Derijard et al . 1995) Science 267:682-685/ Lin et al . 
(1995) Science 268:286-290; Sanchez et al . (1994) Nature 
372:794-798). Biochemical studies demonstrate that MKK4 
5 phosphorylates and activates JNK (Derijard et al . (1995) 
Science 267:682-685; Lin et al . (1995) Science 268:286- 
290; Sanchez et al . (1994) Nature 372:794-798). However, 
the function of MKK4 may not be restricted to the JNK 
signal transduction pathway because MKK4 also 
phosphorylates and activates p38 MAP kinase (Derijard et 
al. (1995) Science 267:682-685; Lin et ai . (1995) Science 
268:286-290). This specificity of MKK4 to activate both 
JNK and p3 8 MAP kinase provides a mechanism that may 
account for the co-ordinate activation of these MAP 
kinases in cells treated with cytokines or environmental 
stress (Davis (1994) Trends Biochem. Sci. 19:470-473). 
However, this co-ordinate activation is not always 
observed. For example, JNK activation in the liver 
correlates with decreased p38 MAP kinase activity 
(Mendelson et al. (1996) Proc. Natl. Acad. Sci. USA 
93:12908-12913). These data suggest that the properties 
of MKK4 are insufficient to account for the regulation of 
JNK in vivo. 

The isolation of human MKKs is described in 
Example 1, Example 22, Derijard et al . ((1995) Science 
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267:682-685, hereby specifically incorporated by 
reference), and Raingeaud et al. ((1995) Mol . Cell. Biol. 
16:1247-1255). Distinctive regions of the yeast PBS2 
sequence were used to design polymerase chain reaction 
5 (PCR) primers. Amplification of human brain mRNA with 
these primers resulted in the formation of specific 
products which were cloned into a plasmid vector and 
sequenced. Two different complementary DNAs (cDNAs) that 
encoded human protein kinases were identified: one 
10 encoding a 36 kD protein (MKK3) , and one encoding a 44 kD 
protein (MKK4) . MKK4 includes 3 isoforms that vary 
slightly at the NH 2 - terminal , identified as a, (3, and 7. 
The amino acid sequences of MKK3 (SEQ ID NO : 2 ) , MKK4 - a 
(SEQ ID NO:6), MKK4-/3 (SEQ ID NO:8), and MKK4-7 (SEQ ID 
15 NO: 10) are shown in Fig. 1. The nucleic acid and amino 
acid sequences of MKK3 (Fig. 4), MKK6 (Fig. 5), MKK4 -a 
(Fig. 6), MKK4-/3 (Fig. 7), and MKK4-T (Fig. 8) are also 
provided. MKK6 was isolated from a human skeletal muscle 
library by cross -hybridization with MKK3 . Except for 
20 differences at the N- terminus, MKK6 is highly homologous 
to MKK3 . Other human MKK3 and MKK4 isoforms that: exist 
can be identified by the method described in Example 1. 

The expression of these human MKK isoforms was 
examined by Northern (RNA) blot analysis of mRNA isolated 
25 from eight adult human tissues (Example 2) . Both protein 
kinases were found to be widely expressed in human 
tissues, with the highest expression seen in skeletal 
muscle tissue. 

The substrate specificity of MKK3 was investigated 
3 0 in an in vitro phosphorylation assay with recombinant 
epitope- tagged MAP kinases (JNK1, p3 8, and ERK2) as 
substrates (Example 3) . MKK3 phosphorylated p3 8, but did 
not phosphorylate JNK1 or ERK2 . Phosphoaminoacid 
analysis of p3 8 demonstrated the presence of a 
35 phosphothreonine and phosphotyrosine . Mutational 
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analysis of p3 8 demonstrated that replacement of 
phosphorylation sites Thr 180 and Tyr 182 with Ala and Phe, 
respectively, blocked p3 8 phosphorylation. These results 
establish that MKK3 functions in vitro as a p3 8 MAP 

5 kinase kinase. 

Studies of the in vitro substrate specificity of 
MKK4 are described in Example 4. MKK4 incubated with [-y- 
32 P]ATP, and JNK1 , p3 8, or ERK2 was found to phosphorylate 
both p38 and JNK1 . MKK4 activation of JNK and p38 was 

0 also studied by incubating MKK4 with wild- type or mutated 
JNK1 or p38. The p38 substrate ATF2 was included in each 
assay. MKK4 was found to exhibit less 

autophosphcrylation than MKK3 . MKK4 was also found to be 
a substrate for activated MAP kinase. Unlike MKK3 , MKK4 
5 was also found to activate JNK1 . MKK4 incubated with 
wild-type JNK1 , but not mutated JNK1 , resulted in 
increased phosphorylation of ATF2 . These results 
establish that MKK4 is a p3 8 MAP kinase kinase that also 
phosphorylates the JNK subgroup of MAP kinases. 

In vivo activation of p38 by UV-stimulated MKK3 is 
described in Example 5. Cells expressing MKK3 were 
exposed in the presence or absence of UV radiation. MKK3 
was isolated by immunoprecipitation and used for protein 
kinase assays with the substrates p38 or JNK. ATF2 was 
included in some assays as a substrate for p38 and JNK. 
MKK3 from non-activated cultured COS cells caused a small 
amount of phosphorylation of p3 8 MAP kinase, resulting 
from basal activity of MKK3 . MKK3 from UV- irradiated 
cells caused increased phosphorylation of p38 MAP kinase, 
but not of JNK1 . An increase in p3 8 activity was also 
detected in assays in which ATF2 was included as a 
substrate. These results establish that MKK3 is 
activated by UV radiation. 

The effect of expression of MKK3 and MKK4 on p3 8 
activity was examined in COS-1 cells (Example 6) . Cells 
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were transfected with a vector encoding p3 8 and a MEK1, 
MKK3, or MKK4 . Some of the cells were also exposed to 
EGF or UV radiation. p3 8 was isolated by 
immunoprecipitation and assayed for activity with [7- 
5 32 P]ATP and ATF2 . The expression of the ERK activator 
MEK1 did not alter p38 phosphorylation of ATF2 . In 
contrast, expression of MKK3 or MKK4 caused increased 
activity of p38 MAP kinase. The activation of p38 caused 
by MKK3 and MKK4 was similar to that observed in UV- 

10 irradiated cells, and was much greater than that detected 
in EGF- treated cells. These in vitro results provide 
evidence that MKK3 and MKK4 activate p3 8 in vivc. 

A series of experiments was conducted to examine 
the potential regulation of ATF2 by JNK1 . These 

15 experiments are described in Gupta et al . (1995) Science 
267:389-393, hereby specifically incorporated by 
reference. The effect of UV radiation on ATF2 
phosphorylation was investigated in COS-1 cells 
transfected with and without epitope- tagged JNK1 (Example 

2 0 7) . Cells were exposed to UV radiation, and JNK1 and 

JNK2 visualized by in-gel protein kinase assay with the 
substrate ATF2 . JNK1 and JNK2 were detected in 
transfected and non- transfected ceils exposed tc UV 
radiation; however, JNK1 levels were higher in the 
25 transfected cells. These results demonstrate that ATF2 
is a substrate for the JNK1 and JNK2 protein kinases, and 
that these protein kinases are activated in cells exposed 
to UV light. 

The site of JNK1 phosphorylation of ATF2 was 

3 0 examined by deletion analysis (Example 8) . Progressive 

NH 2 - terminal domain deletion GST-ATF2 fusion proteins were 
generated, and phosphorylation by JNK1 isolated from Un- 
irradiated cells was examined. The results showed that 
JNK1 requires the presence of ATF2 residues 1-60 for 
35 phosphorylation of the NH 2 -terminal domain of ATF2 . 
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The ATF2 residues required for binding of JNK1 
were similarly examined. JNK1 was incubated with 
immobilized ATF2 , unbound JNK1 was removed by extensive 
washing, and bound JNK1 was detected by incubation with 

5 [y- 32 p]ATP. Results indicate that residues 20 to 60 of 
ATF2 are required for binding and phosphorylation by 
JNK1 . A similar binding interaction between ATF2 and the 
55 kD JNK2 protein kinase has also been observed. 

Phosphorylation by JNK1 was shown to reduce the 

0 electrophoretic mobility of ATF2 (Example 9) . 

Phosphoamino acid analysis of the full-length ATF2 
molecule (residues 1-505} demonstrated that JNK 
phosphorylated both Thr and Ser residues. The major 
sites of Thr and Ser phosphorylation were located in the 

5 NH 2 and COOH terminal domains, respectively. The NH 2 - 

terminal sites of phosphorylation were identified as Thr 69 
and Thr 71 by phosphopept ide mapping and mutational 
analysis. These sites of Thr phosphorylation are located 
in a region of ATF2 that is distinct from the sub-domain 

D required for JNK binding (residues 20 to 60) . 

The reduced electrophoretic mobility seen with 
phosphorylation of ATF2 was investigated further (Examole 
10) . JNK1 was activated in CHO cells expressing JNK1 by 
treatment with UV radiation, pro- inflammatory cytokine 

\ interleukin-1 (IL-1) , or serum. A decreased 

electrophoretic mobility of JNK1 -activated ATF2 was 
observed in cells treated with UV radiation and IL-1. 
Smaller effects were seen after treatment of cells with 
serum. These results indicate that ATF2 is an in vivo 

) substrate for JNK1 . 

The effect of UV radiation on the properties of 
wild-type (Thr 63 ' 71 ) and phosphorylation-def ective (Ala 69 - 71 ) 
ATF2 molecules was investigated (Example 11) . Exposure 
to UV caused a decrease in the electrophoretic mobility 

5 of both endogenous and over-expressed wild-type ATF2 . 
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This change in electrophoretic mobility was associated 
with increased ATF2 phosphorylation. Both the 
electrophoretic mobility shift and increased 
phosphorylation were blocked by the replacement of Thr 69 
5 and Thr 71 with Ala in ATF2 . This mutation also blocked 
the phosphorylation of ATF2 on Thr residues in vivo. 

Transcriptional activities of fusion proteins 
consisting of the GAL 4 DNA binding domain and wild- type 
or mutant ATF2 were examined (Example 12) . Point 
0 mutations at Thr 69 and/or Thr 71 of ATF2 significantly 
decreased the transcriptional activity of ATF2 relative 
to the wild- type molecule, indicating the physiological 
relevance cf phosphorylation at these sites for activity. 

The binding of JNK1 to the NH 2 - terminal activation 
5 domain of ATF2 (described in Example 8) suggested that a 
catalytically inactive JNK1 molecule could function as a 
dominant inhibitor of the wild-type JNK1 molecule. This 
hypothesis was investigated by examining the effect of a 
catalytically inactive JNK1 molecule on ATF2 function 
(Example 13) . A catalytically- inactive JNK1 mutant was 
constructed by replacing the sites of activating Thr 133 
and Tyr 165 phosphorylation with Ala and Phe , respectively 
(Ala 133 , Phe :35 , termed ■■ dominant -negative" ) . Expression of 
wild-type JNK1 caused a small increase in serum- 
stimulated ATF2 transcriptional activity. In contrast, 
dominant -negative JNK1 inhibited both control and serum- 
stimulated ATF2 activity. This inhibitory effect results 
from the non-productive binding of the JNK1 mutant to the 
ATF2 activation domain, effectively blocking ATF2 
phosphorylation. 

The tumor suppressor gene product Rb binds to ATF2 
and increases ATF2- stimulated gene expression (Kim et al . 
(1992) Nature 358:331). Similarly, the adenovirus 
oncoprotein E1A associates with the DNA binding domain of 
ATF2 and increases ATF2- stimulated gene expression by a 
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mechanism that requires the NH 2 - terminal activation domain 
of ATF2 (Liu and Green (1994) Nature 368:520). ATF2 
transcriptional activity was investigated with the 
lucif erase reporter gene system in control, Rb-treated, 
5 and E1A- treated cells expressing wild- type or mutant ATF2 
molecules (Example 14) . Rb and E1A were found to 
increase ATF2 -stimulated gene expression of both wild- 
type and mutant ATF2 . However, mutant ATF2 caused a 
lower level of reporter gene expression than did wild- 
10 type ATF2 . Together, these results indicate a 

requirement for ATF2 phosphorylation (on Thr 69 and Thr 71 ) 
plus either Rb or E1A for maximal transcriptional 
activity. Thus, Rb and E1A act in concert with ATF2 
phosphorylation to control transcriptional activity. 

15 A series of experiments were conducted to examine 

the action of p3 8 activation and to establish the 
relationship of the p3 8 MAP kinase pathway to the ERK and 
JNK signal transduction pathways (Raingeaud et ai . (1995) 
J. Biol. Chem. 270:7420, hereby specifically incorporated 

20 by reference) . Initially, the substrate specificity of 
p3 8 was investigated by incubating p3 8 with proteins that 
have been demonstrated to be substrates for the ERK 
and/or JNK groups of MAP kinases (Example 15) . We 
examined the phosphorylation of MBP (Erickson et al . 

25 (1990) J. Biol. Chem. 265:19728), EGF-R (Northwood et al . 
(1991) J. Biol. Chem. 266:15266), cytoplasmic 
phospholipase A 2 (cPLA 2 ) (Lin et al . (1993) Cell 72:269), 
c-Myc (Alvarez et al . (1991) J. Biol. Chem. 266:15277), 
I kB, c-Jun, and wild-type (Thr 69 ' 71 ) or mutated (Ala 69 - 71 ) 

30 ATF2 . p38 phosphorylated MBP and EGF-R, and to a lesser 
extent IkB, but not the other ERK substrates, 
demonstrating that the substrate specificity of p38 
differs from both the ERK and JNK groups of MAP kinases. 
Wild-type ATF2 , but not mutated ATF2 (Ala 69 ' 71 ), was found 

35 to be an excellent p38 substrate. 
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The phosphorylation of ATF2 by p3 8 was associated 
with an electrcphoretic mobility shift of ATF2 during 
polyacrylamide gel electrophoresis. We tested the 
hypothesis that p3 8 phosphorylates ATF2 at the same sites 
5 as JNK1 by replacing Thr 69 and Thr 71 with Ala (Ala 59 ' 71 ) . it 
was found that p3 8 did not phosphorylate mutated ATF2 , 
which demonstrates chat p3 8 phosphorylates ATF2 within 
the NH 2 - terminal activation domain on Thr 69 and Thr 71 . 

A comparison of the binding of JNK and p3 8 to ATF2 
0 was conducted by incubating extracts of cells expressing 
JNK1 or p3 8 with epitope alone (GST) or GST-ATF2 
(residues 1-109 containing the activation domain) 
(Example 16) . Bound protein kinases were detected by 
Western blot analysis. The results demonstrate that both 
5 p38 and JNK bind to the ATF2 activation domain. 

EGF and phorbol ester are potent activators of the 
ERK signal transduction pathway (Egan and Weinberg (1993) 
Nature 365:781), causing maximal activation of the ERK 
sub-group of MAP kinases. These treatments, however, 
cause only a small increase in JNK protein kinase 
activity (Derijard et ai . (1994) supra; Hibi et al . 
(1993) supra) . The effects of EGF or phorbol esters, as 
well UV radiation, osmotic shock, mterleukin- 1 , tumor 
necrosis factor, and LPS, on p38 activity were all tested 
(Example 17) . Significantly, EGF and phorbol ester 
caused only a modest increase in p38 protein kinase 
activity, whereas environmental stress (UV radiation and 
osmotic shock) caused a marked increase in the activity 
of both p3 8 and JNK. Both p3 8 and JNK were activated in 
cells treated with pro- inflammatory cytokines (TNF and 
IL-1) or endotoxic LPS. Together, these results indicate 
that p38, like JNK, is activated by a stress-induced 
signal transduction pathway. 

ERKs and JNKs are activated by dual 
phosphorylation within the motifs Thr-Glu-Tyr and Thr- 
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Pro-Tyr, respectively. In contrast, p3 8 contains the 
related sequence Thr-Gly-Tyr. To test whether this motif 
is relevant to the activation of p38, the effect of the 
replacement of Thr-Gly-Tyr with Ala-Gly-Phe was examined 
5 (Example 18) . The effect of UV radiation on cells 
expressing wild- type (Thr 120 , Tyr 132 ) or mutant p3 8 
(Ala 180 , Phe 182 ) was studied. Western blot analysis using an 
anti-phosphotyrosme antibody demonstrated that exposure 
to UV radiation caused an increase in the Tyr 
0 phosphorylation of p38. The increased Tyr 

phosphorylation was confirmed by phosphoamino acid 
analysis of p38 isolated from [7 - 32 P] phosphate- labeled 
cells. This analysis also demonstrated that UV radiation 
caused increased Thr phosphorylation of p38. 
5 Significantly, the increased phosphorylation on Thr 180 and 
Tyr 132 was blocked by the Ala 180 /Phe 182 mutation. This 
result demonstrates that UV radiation causes increased 
activation of p3 8 by dual phosphorylation. 

It has recently been demonstrated that ERK 
0 activity is regulated by the mitogen- induced dual 
specificity phosphatases MKP1 and PAC1 (Ward et ai . 
(1994) Nature 367:651). The activation of p38 by dual 
phosphorylation (Example 18) raises the possibility that 
p38 may also be regulated by dual specificity 
5 phosphatases. We examined the effect of MKP1 and PAC1 on 
p38 MAP kinase activation (Example 19) . Cells expressing 
human MKP1 and PAC1 were treated with and without UV 
radiation, and p3 8 activity measured. The expression of 
PAC1 or MKP1 was found to inhibit p3 8 activity. The 
0 inhibitory effect of MKP1 was greater than PAC1 . In 

contrast, cells transfected with a catalytically inactive 
mutant phosphatase (mutant PAC1 Cys 257 /Ser) did not 
inhibit p3 8 MAP kinase. These results demonstrate that 
p38 can be regulated by dual specificity phosphatases 
5 PAC1 and MKP1 . 
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The sub-cellular distribution of p38 MAP kinase 
was examined by indirect immunofluorescence microscopy 
(Example 20) . Epitope- tagged p3 3 MAP kinase was detected 
using the M2 monoclonal antibody. Specific staining of 

5 cells transfected with epitope-cagged p38 MAP kinase was 
observed at the cell surface, in the cytoplasm, and in 
the nucleus. Marked changes in cell surface and nuclear 
p3 8 MAP kinase were not observed following UV 
irradiation, but an increase in the localization of 

0 cytoplasmic p3 8 MAP kinase to the perinuclear region was 
detected. 

A series of experiments were conducted to study 
the activation of JNK by hyper-csmotic media (Example 
21) . These experiments were reported by Galcheva-Gargova 
5 et al. (1994) Science 265:806, hereby specifically 

incorporated by reference. CHO cells expressing epitope- 
tagged JNK1 were incubated with 0 - 1000 mM sorbitol, and 
JNK1 activity measured in an immune complex kinase assay 
with the substrate c-Jun. Increased JNK1 activicy was 
observed in cells incubated for 1 hour with 100 mM 
sorbitol. Increased JNK1 activity was observed within 5 
minutes of exposure to 300 mM sorbitol. Maximal activity 
was observed 15 to 3 0 minutes after osmotic shock with a 
progressive decline in JNK1 activity at later times. The 
activation of JNK by osmotic shock was studied in cells 
expressing wild- type (Thr 183 , Tyr 135 ) or mutated (Ala 133 , 
Phe 135 ) JNK1. JNK1 activity was measured after incubation 
for 15 minutes with or without 3 00 mM sorbitol. Cells 
expressing wild-type JNK1 showed increased JNK1 activity, 
while cells expressing mutated JNK1 did not. These 
results demonstrate that the JNK signal transduction 
pathway is activated in cultured mammalian cells exposed 
to hyper-osmotic media. 

The results of the above-described experiments are 
illustrated in Fig. 3, which diagrams the ERK, p3 8, and 
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JNK MAP kinase signal transduction pathways. ERKs are 
potently activated by treatment of cells with EGF or 
phorbol esters. In contrast, p38 is only slighcly 
activated under these conditions (Example 15) . However, 
5 UV radiation, osmotic stress, and inflammatory cytokines 
cause a marked increase in p3 8 activity. This difference 
in the pattern of activation of ERK and p38 suggests that 
these MAP kinases are regulated by different signal 
transduction pathways. The molecular basis for the 
0 separate identity of these signal transduction pathways 
is established by the demonstration that the MAP kinase 
kinases that activate ERK (MEK1 and MEK2 ) and p3 8 (MKK3 , 
MKK4 , and MKK6) are distinct. 

The isolation of murine and human MKK7 is 
5 described in Example 22. Distinctive regions of the 
Drosophila MAP kinase kinase hep sequence were used to 
design polymerase chain reaction (PCR) primers. 
Amplification of murine testis mRNA with these primers 
resulted in the formation of specific products which were 
cloned into a plasmid vector and sequenced. One sequence 
related to hep was identified and used to screen a murine 
testis library. Five DNAs (cDNAs) that encoded protein 
kinases were identified: one encoding a MAP protein 
kinase kinase (MKK7) . The others encoded various splice 
variants: MKK7b (a partial sequence appears in Fig. 11), 
MKK7c (Fig. 13), MKK7d (Fig. 14), MKK7e (Fig. 15). The 
deduced amino acid sequences of MKK7 (SEQ ID NO: 18) and 
hep (SEQ ID NO: 21) are shown in Fig. 9, and compared to 
the MAP kinase kinases MEK1 (SEQ ID NO:ll), MEK2 (SEQ ID 
NO:12), MKK3 (SEQ ID NO : 2 ) , MKK4 (SEQ ID NO:10), MKK5 
(SEQ ID NO: 22) , and MKK6 (SEQ ID NO : 4 ) . A human MKK7 was 
identified by screening a human cDNA library with a full- 
length (mouse) MKK7 cDNA probe. The identified partial 
sequence (lacking the 3 'end) is homologous to mouse 
MKK7c. 
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The expression of MKK7 and MKK4 isoforms was 
examined by Northern (RNA) blot analysis of poly A+ mRNA 
isolated from eight murine tissues (Example 23) . Both 
protein kinases were found to be widely expressed. 
5 The substrate specificity of MKK7 was investigated 

in an in vitro phosphorylation assay with recombinant, 
epitope- tagged MAP kinases (JNK1, p3 8, and ERK2 } as 
substrates (Example 24) . MKK7 phosphorylated JNK, but 
did not phosphorylate p3 8 or ERK2 . MKK7 was 
0 phosphorylated by p38 and JNK1 . 

MKK7 was found to specifically activate JNK 
protein kinase in vivo (Example 25) . CHO cells were co- 
transfected with an epitope- tagged MAP kinase (JNK1, p38, 
or ERK2) together with an empty expression vector or an 
5 expression vector encoding MKK1 , MKK4 , MKK6, or MKK7 and 
the product of the phosphorylation reaction analyzed. 
MKK7 activated only JNK1 , and did so to a greater extent 
than did MKK4 . 

To test whether MKK7 could cause increased AP-1 
transcriptional activity, a co- transf ection assay was 
employed (Example 26) . Co-expression of MKK7 with JNK 
caused an increase in AP-1 reporter gene expression that 
was greater than the increase seen with MKK4 and JNK. A 
similar result was seen when ATF2 was used as the 
reporter gene. In addition, MKK7 alone was able to 
increase expression of ATF2 (Fig. 16) . 

MKK isoforms are useful for screening reagents 
which modulate MKK activity. Described in the Use 
section following the Examples are methods for 
identifying reagents capable of inhibiting or activating 
MKK activity. 

The discovery of human MKK isoforms and MKK- 
mediated signal transduction pathways is clinically 
significant for the treatment of MKK-mediated disorders. 
One use of the MKK isoforms is in a method for screening 
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reagents able to inhibit or prevent the activation of the 
MKK-MAP kinase- ATF2 pathways. 

EXAMPLES 

The following examples are meant to illustrate, 
5 not limit, the invention. 

Example 1 . MKK Protein Kinases 

The primary sequences of MKK3 and MKK4 were 
deduced from the sequence of cDNA clones isolated from a 
human fetal brain library. 
0 The primers TTYTAYGGNGCNTTYTTYATHGA (SEQ ID NO: 14) 

and ATBCTYTCNGGNGCCATKTA (SEQ ID NO: 15) were designed 
based on the sequence of PBS2 (Brewster et al . (1993) 
Science 259:1760/ Maeda et al . (1994) Nature 369:242). 
The primers were used in a PCR reaction with human brain 
5 mRNA as template. Two sequences that encoded fragments 
of PBS2 -related protein kinases were identified. Full- 
length human cDNA clones were isolated by screening of a 
human fetal brain library (Derijard et al . (1995) Science 
267:682-685). The cDNA clones were examined by 
sequencing with an Applied Biosystems model 3 73A machine. 
The largest clones obtained for MKK3 (2030 base oairs 
(bp)) and MKK4 .3576 bp) contained the entire coding 
region of these protein kinases. 

The primary structures of MKK3 (SEQ ID NO : 2 ) and 
MKK4 - a (SEQ ID NO : 6 ) are shown in Fig. 1. An in- frame 
termination codon is located in the 5' untranslated 
region of the MKK3 cDNA, but not in the 5' region of the 
MKK4 cDNA. The MKK4 protein sequence presented starts at 
the second in- frame initiation codon. 

These sequences were compared to those of the 
human MAP kinase kinases MEK1 (SEQ ID NO: 11) and MEK2 
(SEQ ID NO: 12) (Zheng and Guan (1993) J. Biol. Chem 
268:11435) and of the yeast MAP kinase kinase PBS2 (SEQ 
ID NO: 13) (Boguslawaski and Polazzi (1987) Proc . Natl. 
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Acad. Sci. USA 84:5848) (Fig. I). The identity and 
similarity of the kinases with human MKK3 (between 
subdomains I and XI) were calculated with the BESTFIT 
program (version 7.2; Wisconsin Genetics Computer Group) 
5 (percent of identity to percent of similarity) : MEK1, 
41%/63%; MEK2, 41%/62%; MKK4a, 52%/73%; and PBS2, 
40%/59%) . The identity and similarity of the kinases 
with human MKK4a were calculated to be as follows 
(percent of identity to percent of similarity) : MEK1 , 

10 44%/63%; MEK2 , 45%/61%; MKK3 , 52%/73%; and PBS2 , 44%/58%. 

The cDNA sequences of MKK3 and MKK4y have been 
deposited in GenBank with accession numbers L36719 and 
L36870, respectively. The MKK4 y cDNA sequence contains 
both the cDNA sequences of MKK4a and MKK4 (3 , which are 

15 generated in vivo from alternate splicing sites. One of 
ordinary skill in the art can readily determine the amino 
acid sequences of MKK3 and MKK4 isoforms from the 
deposited cDNA sequences . 

Example 2 . Expression of MKK3 and MKK4 mRNA in Adult 

2 0 Human Tissue 

Northern blot analysis was performed with 
polyadenylated [poiy(A) + ] mRNA { 2 fig) isolated from human 
heart, brain, placenta, lung, liver, muscle, kidney, and 
pancreas tissues. The mRNA was fractionated by 

2 5 denaturing agarose gel electrophoresis and was 

transferred to a nylon membrane. The blot was probed 
with the MKK3 and MKK4 cDNA labeled by random priming 
with [o?- 32 P]ATP (deoxyadenosine triphosphate) (Amersham 
International PLC) . MKK3 and MKK4 were expressed in all 

30 tissues examined/ the highest expression of MKK3 and MKK4 
was found in skeletal muscle tissue. 

The relation between members of the human and 
yeast MAP kinase kinase group is presented as a 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 31 - 

dendrogram (Fig. 2) . MKK3/4 form a unique subgroup of 
human MAP kinase kinases . 

Example 3 . In Vitro Phosphorylation of r>3 8 MAP kinase hv 
MKK3 

5 GST-JNK1, and GST-ERK2 have been described 

(Derijard et al . (1994) supra; Gupta et al . (1995) 
Science 267:389; Wartmann and Davis (1994) J. Biol. Chem. 
269:6695, each herein specifically incorporated by 
reference) . GST-p3 8 MAP kinase was prepared from the 
10 expression vector pGSTag (Dressier et al . (1992) 

Biotechniques 13:866) and a PCR fragment containing the 
coding region of the p3 8 MAP kinase cDNA. GST-MXK3 and 
MKK4 were prepared with pGEX3X (Pharmacia-LKB 
Biotechnology) and PCR fragments containing the coding 

15 region of the MKK3 and MKK4 cDNAs . The GST fusion 

proteins were purified by affinity chromatography with 
the use of GSH-agarose (Smith and Johnson (1988) Gene 
67:31) . The expression vectors pCMV- Flag- JNK1 and pCMV- 
MEK1 have been described (Derijard et al . (1994) supra; 

20 Wartmann and Davis (1994) supra) . The plasmid pCMV-Flag- 
p3 8 MAP kinase was prepared with the expression vector 
pCMVS (Andersson et al . (1989) J. Biol. Chem. 264:8222) 
and rhe p3 8 MAP kinase cDNA . The expression vectors for 
MKK3 and MKK4 were prepared by subcloning of the cDNAs 

25 into the polylinker of pCDNA3 (Invitrogen) . The Flag 

epitope (Asp-Tyr-Lys-Asp-Asp~Asp-Asp-Lys (SEQ IDNO:16); 
Immunex, Seattle, WA) was inserted between codons 1 and 2 
of the kinases by insertional overlapping PCR (Ko et al . 
(1989) Gene 77 :51) . 

30 Protein kinase assays were performed in kinase 

buffer (25 mM 4- (2 -hydroxy ethyl ) -1- 
piperazineethansulf onic acid, pH 7.4, 25 mM /?- 
glycerophosphate, 2 5 mM MgCl 2/ 2 mM dithiothreitcl , and 
0.1 mM orthovanadate) . Recombinant GST-MKK3 was 
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incubated with [y- 32 P]ATP and buffer, GST-JNK1, GST-p38 
MAP kinase, or GST-ERK2. The assays were initiated by 
the addition of 1 /xg of substrate proteins and SO /zM [7- 
32 P]ATP (10 Ci/mmol) in a final volume of 25 fil . The 
5 reactions were terminated after 30 minutes at 25oc by 
addition of Laemmli sample buffer. The phosphorylation 
of the substrate proteins was examined after SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) by 
autoradiography. Phosphoaminoacid analysis was performed 
0 by partial acid hydrolysis and thin- layer chromatography 
(Derijard et al . (1994) supra; Alvarez et al . (1991) J. 
Biol. Chem. 266:15277). Autophosphorylation of MKK3 was 
observed in all groups. MKK3 phosphoryiated p38 MAP 
kinase, but not JNK1 or ERK2 . 

A similar insert ional overlapping PCR procedure 
was used to replace Thr 180 and Tyr 182 of p38, with Ala and 
Phe, respectively. The sequence of all plasmids was 
confirmed by automated sequencing on an Applied 
Biosystems model 373A machine. GST-MKK3 was incubated 
with [y- 32 P]ATP and buffer, wild-type GST-p38 MAP kinase 
(TGY) , or mutated GST-p38 MAP kinase (AGF) . The 
phosphoryiated proteins were resolved by SDS-PAG2 and 
detected by autoradiography. Only phosphorylation of 
wild-type p38 was observed. 

Example 4 . In Vitro Phosphorylation and Activation of 
JNK and o3 8 MAP Kinase bv MKK4 
Protein kinase assays were conducted as described 
in Example 3 . Recombinant GST-MKK4 was incubated with 
[y- 32 P]ATP and buffer, GST-JNK1, GST-p3 8 MAP kinase, or 
GST-ERK2. JNK1 and p38 were phosphoryiated, as was MKK4 
incubated with JNK1 and p3 8. 

GST-MKK4 was incubated with [y- 32 P]ATP and buffer, 
wild-type JNK1 (Thr 183 , Tyr 185 ) , or mutated GST-JNK1 (Ala 183 , 
Phe 185 ). The JNK1 substrate ATF2 (Gupta et al . (1995) 
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supra) was included in each incubation. ATF2 was 
phosphorylated in the presence cf MKK4 and wild- type 
JNK1. The results establish that MKK4 phosphoryiates and 
activates both p3 8 and JNK1 . 

5 Example 5 . Phosphorylation and Activation of c3 8 MAP 

Kinase by UV- stimulated MKK3 
Epitope- nagged MKK3 was expressed in COS-i cells 
maintained in Dulbecco's modified Eagle's medium 
supplemented with fetal bovine serum (5%) (Gibco-3RL) 
10 The cells were transfected with the lipof ect amine reagent 
according to the manufacturer's recommendations 'Gibco- 
BRL) and treated with UV radiation or EGF as described 
(Derijard et al . (1994) supra). 

The cells were exposed in the absence and presence 
15 of UV-C (40 J/m 2 ) . The cells were solubilized with lysis 
buffer (20 mM tris, pH 7.4, 1% TRITON® X-100, 10% 
glycerol, 13 7 mM NaCl , 2 mM EDTA, 2 5 mM 0- 
glycerophosphate , 1 mM Na orthovanadate , 1 mM 
phenylmethylsulf onyl fluoride, and leupeptin (10 fig /ml) ) 
20 and centrifuged at 100,000 x g for 15 minutes at 4oc. 
MKK3 was isolated by immunoprecipitation . The epitope- 
tagged protein kinases were incubated for 1 hour at 4°C 
with the M2 antibody to the Flag epitope (IBI -Kodak) 
bound to protein G-Sepharose ( Pharmacia-LKB 
25 Biotechnology) . The immunoprecipitates were washed twice 
with lysis buffer and twice with kinase buffer. 

Protein kinase assays were conducted with the 
substrate GST-p3 8 MAP kinase or JNK1 . ATF2 was included 
in some assays. Basal levels of MKK3 phosphorylation of 
30 p38 MAP kinase were observed. UV- irradiation resulted in 
increased phosphorylation of p3 8 MAP kinase, but not of 
JNK1. The increased p38 MAP kinase activity resulted in 
increased phosphorylation of ATF2 . 
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Example 6 . Activation of d38 MAP Kinase in Cells 
Expressing MKK3 and MKK4 
COS-1 cells were transfected with epitope- tagged 
p3 8 MAP kinase, together with an empty expression vector 
5 or an expression vector encoding MEK1 , MKK3 , or MKK4 a . 
Some of the cultures were exposed to UV radiation (40 
J/m 2 ) or treated with 10 nM EGF. p38 MAP kinase was 
isolated by immunoprecipitation with M2 monoclonal 
antibody, and the protein kinase activity was measured in 

10 the immunecomplex with [y- 32 P]ATP and ATF2 as substrates. 
The product of the phosphorylation reaction was 
visualized after SDS-PAGE by autoradiography. ATF2 was 
not phosphorylated in the control MEK1, or EGF- treated 
groups, but was phosphorylated in the MKK3 , MKK4 , and UV- 

15 irradiated groups. MKK3 and MKK4 phosphorylation of ATF2 
was similar to that seen with p3 8 MAP kinase isolated 
from UV-irradiated cells. 

Example 7. Phosphorylation of ATF2 by JNK1 and JNK2 

COS-1 cells were maintained in Dulbecco's modified 

20 Eagle's medium supplemented with bovine serum albumin 
(5%) (Gibco-BRL) . Metabolic labeling with [ 32 ] P was 
performed by incubation of cells for 3 hours in 
phosphate- free modified Eagle's medium (Flow Laboratories 
Inc.) supplemented with [ 32 P] orthophosphate (2 mCi/ml) 

25 (Dupont-NEN) . COS-1 cells were transfected without 
(Mock) and with epitope- tagged JNK1 (JNK1) . Plasmid 
expression vectors encoding the JNK1 cDNA have previously 
been described (Derijard et al . (1994) Cell 76:1025, 
herein specifically incorporated by reference) . Plasmid 

30 DNA was transfected into COS-1 cells by the lipof ectamine 
method (Gibco-BRL) . After 4 8 hours of incubation, some 
cultures were exposed to 4 0 J/m 2 UV radiation and 
incubated for 1 hour at 3 7oc. 
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Cells were lysed in 20 mM Tris, pH 7.5, 25 mM (3- 
glycerophosphate, 10% glycerol, 1% Triton® X-100, 0.5% 
(w/v) deoxycholate, 0.1% (w/v) SDS , 0.13 7 M NaCl , 2 mM 
pyrophosphate, 1 mM orthovanadate, 2 mM EDTA, 10 fig /ml 
5 leupeptin, 1 mM PMSF. Soluble extracts were prepared by 
centrif ugation in a microfuge for 2 0 minutes at 4oc. 
JNK1 immunoprecipitates were also prepared by reaction 
with a rabbit antiserum prepared with recombinant JNK1 as 
an antigen. 

0 In-gel protein kinase assays were performed with 

cell lysates and JNK1 immunoprecipitates after SDS -PAGE 
by renaturation of protein kinases, polymerization of the 
substrate (GST-ATF2 , residues 1-505) in the gel, and 
incubation with Ct- 32 P]ATP (Derijard et al . (1994) supra). 
5 The incorporation of [ 32 P] phosphate was visualized by 

autoradiography and quant it ated with a Phosphorimager and 
ImageQuant software (Molecular Dynamics Inc., Sunnyvale, 
CA) . The cell lysates demonstrate the presence of 46 kD 
and 55 kD protein kinases that phosphorylate ATF2 in 
0 extracts prepared from UV-irradiated cells. The 46 kD 
and 55 kD protein kinases were identified as JNK1 and 
JNK2 , respectively . 

Example 8. Binding of JNK1 to ATF2 and Phosphorylation 
of the NH. -Terminal Activation Domain 
The site of JNK1 phosphorylation of ATF2 was 
investigated by generation of progressive NH 2 -terminal 
domain deletions of ATF2 . Plasmid expression vectors 
encoding ATF2 (pECE-ATF2) (Liu and Green (1994) and 
(1990)), have been described. Bacterial expression 
vectors for GST-ATF2 fusion proteins were constructed by 
sub-cloning ATF2 cDNA fragments from a polymerase chain 
reaction (PCR) into pGEX-3X (Pharmacia-LKB Biotechnology 
Inc.). The sequence of all constructed plasmids was 
confirmed by automated sequencing with an Applied 
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Biosystems model 3 73A machine. The GST-ATF2 proteins 
were purified as described (Smith and Johnson (198 8) Gene 
67:31), resolved by SDS-PAGE and stained with Ccomassie 
blue. GST-ATF2 fusion proteins contained residues 1-505, 
5 1-349, 350-505, 1-109, 20-109, 40-109, and 60-109. 

The phosphorylation of GST-ATF2 fusion proteins by 
JNK1 isolated from UV- irradiated cells was examined in an 
immunocomplex kinase assay. Immunecomplex kinase assays 
were performed with Flag epitope- tagged JNK1 and the 
10 monoclonal antibody M2 (IBI -Kodak) as described by 

Derijard et al . (1994) supra). Immunecomplex protein 
kinase assays were also performed with a rabbit antiserum 
prepared with recombinant JNK1 as an antigen. The cells 
were solubilized with 20 mM Tris, pH 7.5, 10% glycerol, 
15 1% Triton® X-100, 0.137 M NaCl , 25 mM ^-glycerophosphate, 
2 mM EDTA, 1 mM orthovanadate , 2 mM pyrophosphate, 10 
/xg/ml leupeptin, and 1 mM PMSF. JNK1 was 
immunoprecipitated with protein G-Sepharose bound to a 
rabbit polyclonal antibody to JNK or the M2 monoclonal 

2 0 antibody to the Flag epitope. The beads were washed 

three times with lysis buffer and once with kinase buffer 
(20 mM Hepes, pH 7.6, 20 mM MgCl 2 , 25 mM /3- 
glycerophosphate, 10 0 ^M Na orthovanadate, 2 mM 
dithiothreitol) . The kinase assays were performed at 

25 25oc for 10 minutes with 1 /xg of substrate, 20 fiM 

adenosine triphosphate and 10 fiCi of [y- 32 P]ATP in 30/il of 
kinase buffer. The reactions were terminated with 
Laemmli sample buffer and the products were resolved by 
SDS-PAGE (10% gel) . JNK1 phosphorylates GST-ATF2 fusion 

30 proteins containing residues 1-505, 1-349, 1-109, 20-109, 
and 40-109, but not 60-109. These results indicate that 
the presence of ATF2 residues 1-60 are required for 
phosphorylation by JNK. 

The binding of immobilized GST-ATF2 fusion 

3 5 proteins was examined in a solid-phase kinase assay as 
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described by Hibi et al . ((1993) Genes Dev. 7:2135, 
herein specifically incorporated by reference) . JNK1 
from UV-irradiated cells was incubated with GST-ATF2 
fusion proteins bound to GSH-agarose. The agarose beads 

5 were washed extensively to remove the unbound JNK1 . 
Phosphorylation of the GST-ATF2 fusion proteins by the 
bound JKK1 protein kinase was examined by addition of [y- 
32 P]ATP. JNK1 bound GST-ATF2 fusion proteins containing 
residues 1-505, 1-349, 1-109, 20-109, and 40-109, 

0 indicating that the presence of residues 20-60 were 
required for binding of JNK1 to ATF2 . 

Example 9. Phosphorylation of the NH , - terminal 

Activation Domain of ATF2 on Thr 69 and Thr 71 
by JNK1 

5 The effect of UV radiation on the properties of 

wild-type (Thr 65 ' 71 ) and phosphorylat ion-defective (Ala 69 ' 71 ) 
ATF2 molecules was examined. Mock- transf ected and JNK1- 
transfected COS cells were treated without and with 40 
J/m 2 UV radiation. The epitope-cagged JNK1 was isolated 

) by immunoprecipitation with the M2 monoclonal antibody. 
The phosphorylation of GST-ATF2 (residues 1 to 109) was 
examined in an immuno complex kinase assay as described 
above. The GST-ATF2 was resolved from other proteins by 
SDS-PAGE and stained with Coomassie blue. The 

: phosphorylation of GST-ATF2 was detected by 

autoradiography. JNK1 - transf ected cells, but not mock- 
transf ected cells, phosphorylated ATF2 . JNK1 
phosphorylation of ATF2 was greater in cells exposed to 
UV radiation. Phosphorylation of ATF2 by JNK1 was 

) associated with a decreased electrophoretic mobility. 

In a separate experiment, GST fusion proteins 
containing full-length ATF2 (residues 1 to 505) , an NH 2 - 
terminal fragment (residues 1 to 109) , and a COOH- 
terminal fragment (residues 95 to 5 05) were 
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phosphorylated with JNK1 and the sites of phosphorylation 
analyzed by phosphoamino acid analysis. The methods used 
for phosphopeptide mapping and phosphoamino acid analysis 
have been described (Alvarez et al . (1991) J. Biol. Chem. 
5 266:15277). The horizontal dimension of the peptide maps 
was electrophoresis and the vertical dimension was 
chromatography. The NH 2 - terminal sites of phosphorylation 
were identified as Thr 69 and Thr 71 by phosphopeptide 
mapping and mutational analysis. Site-directed 
10 mutagenesis was performed as described above, replacing 
Thr 69 and Thr 71 with Ala. Phosphorylation of mutated ATF2 
was not observed. 

Example 10. Reduced Electrophoretic Mobility of JNK- 
Activated ATF2 

15 CHO cells were maintained in Ham' s F12 medium 

supplemented with 5% bovine serum albumin (Gibco-BRL) . 
Cells were labeled and transfected with JNK1 as described 
above. CHO cells were treated with UV-C (40 J/nr) , IL-la 
(10 ng/ml) (Genzyme) , or fetal bovine serum (20%) (Gibco- 

20 3RD . The cells were incubated for 30 minutes a~ 37oc 
prior to harvesting. The electrophoretic mobility of 
ATF2 after SDS-PAGE was examined by protein immuno-blct 
analysis. A shift in ATF2 electrophoretic mobility was 
observed in cells treated with UV, IL-1, and serum. 

25 These results indicate that JNK1 activation is associated 
with an electrophoretic mobility shift of ATF2 , further 
suggesting that ATF2 is an in vivo substrate for JNK1 . 

Example 11. Increased ATF2 Phosphorylation After 
Activation of JNK 
30 COS-1 cells were transfected without (control) and 

with an ATF2 expression vector (ATF2 ) , as described above 
(Hai et al . (1989) supra). The effect of exposure of the 
cells to 4 0 J/m 2 UV-C was examined. After irradiation, 
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the cells were incubated for 0 or 3 0 minutes (control) or 
0, 15, 30, and 4 5 minutes (ATF2) at 3 7oc and then 
collected. The electrophoretic mobility of ATF2 during 
SDS-PAGE was examined by protein immuno-blot analysis as 
5 described above. The two electrophoretic mobility forms 
of ATF2 were observed in ATF2 - transf ected cells, but not 
in control cells. 

The phosphorylation state of wild-type (Thr 69 ' 71 ) 
ATF2 and mutated (Ala 69 - 71 ) ATF2 was examined in cells 
10 labeled with [ 32 ]P, treated without and with 40 J/m 2 UV-C, 
and then incubated at 37oc for 30 minutes (Hai ec al . 
(198 9) supra) . The ATF2 proteins were isolated by 
immunoprecipitacion and analyzed by SDS-PAGE and 
autoradiography. The phosphoryiated ATF2 proteins were 
15 examined by phosphoamino acid analysis as described 

above. Both forms of ATF2 contained phosphoserine , but 
only wild- type ATF2 contained phosphothreonine . 

Tryptic phosphopeptide mapping was used to compare 
ATF2 phosphoryiated in vitro by JNK1 with ATF2 
20 phosphoryiated in COS-1 cells. A map was also prepared 
with a sample composed of equal amounts of in vivo and in 
vitro phosphoryiated ATF2 (Mix) . Mutation of ATF2 at 
Thr 69 and Thr 71 resulted in the loss of two tryptic 
phosphopeptides in maps of ATF2 isolated from UV- 
25 irradiated cells. These phosphopeptides correspond to 

mono- and bis -phosphoryiated peptides containing Thr 63 and 
Thr 71 . Both of these phosphopeptides were found in maps 
of ATF2 phosphoryiated by JNK1 in vitro. 

Example 12. Inhibition of ATF2 -Stimulated Gene Expression 
30 by Mutation of the Phosphorylation Sites 

Thr 69 and Thr 71 
A fusion protein consisting of ATF2 and the GAL4 
DNA binding domain was expressed in CHO cells as 
described above. The activity of the GAL4-ATF2 fusion 
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protein was measured in co- transf ection assays with the 
reporter plasmid pGSElbLuc (Seth et al. (1992) C. Biol . 
Chem. 267:24796, hereby specifically incorporated by 
reference) . The reporter plasmid contains five GAL4 

5 sites cloned upstream of a minimal promoter element and 
the firefly lucif erase gene. Transf ection efficiency was 
monitored with a control plasmid that expresses 2- 
galactosidase (pCHHO; Pharmacia -LKB Biotechnology). The 
luciferase and fi-galactosidase activity detected in cell 

0 extracts was measured as the mean activity ratio of three 
experiments (Gupta et al . (1993) Proc . Natl. Acad. Sci. 
USA 90:3216, hereby specifically incorporated by 
reference). The results , shown in Table 1, demonstrate 
the importance of phosphorylation at Thr 69 and Thr 71 for 

5 transcriptional activity. 

TABLE 1. INHIBITION OF ATF-2 STIMULATED GENE EXPRESSION 
BY MUTATION OF THE PHOSPHORYLATION SITES THR 69 ' 71 



PROTEIN 


LUCIFERASE ACTIVITY 
(Light Units/OD) 


GAL4 


45 


GAL4-ATF2 (wild type) 


320 , 000 


GAL4-ATF2 (Ala") 


24 , 000 


GAL4-ATF2 (Ala 71 ) 


22 , 000 


GAL4-ATF2 (Ala 53 ' 71 ) 


29, 000 


GAL4-ATF2 (Glu 59 ) 


27, 000 



Example 13 . Effect of Dominant -Negative JNK1 Mutant on 
ATF2 Function 
The luciferase reporter plasmid system was used to 
determine the effect of point mutations at the ATF2 
phosphorylation sites Thr 69 and Thr 71 in serum- treated CHO 
cells transf ected with wild- type (Thr 183 , Tyr 185 ) or mutant 
(Ala 133 , Phe 185 ) JNK1 . Control experiments were done with 
mock- transf ected cells. The CHO cells were serum-starved 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 41 - 

for 18 hours and then incubated without or with serum for 
4 hours. Expression of wild-type ATF2 caused a small 
increase in serum- stimulated ATF2 transcriptional 
activity. In contrast, mutant JNK1 inhibited both 
5 control and serum- stimulated ATF2 activity. 

Example 14 . Effect of Tumor Suppressor Gene Product Rb 
and Adenovirus Oncoprotein E1A on ATF2 - 
Stimulated Gene Expression 
The effect of expression of the Rb tumor 

1C suppressor gene product and adenovirus oncoprotein E1A on 
ATF2 transcriptional activity were investigated with a 
lucif erase reporter plasmid and GAL4 -ATF2 (residues 1- 
505), as described above. Cells were transfected with 
wild-type (Thr 69 ' 71 ) or mutated (Ala 69 ' 71 ) ATF2 . No effect 

15 of Rb or E1A on luciferase activity was detected in the 
absence of GAL4 -ATF2 . Rb and E1A were found to increase 
ATF2 -stimulated gene expression of both wild-type and 
mutated ATF2 . However, mutated ATF2 caused a lower level 
of reporter gene expression than did wild-type ATF2 . 

2C These results indicate a requirement for ATF2 

phosphorylation (cn Thr 65 and Thr 71 ) plus either Rb or E1A 
for maximal transcriptional activity. 

Example 15 . Substrate Specificity of p38 MAP Kinase 

Substrate phosphorylation by p38 MAP kinase was 

25 examined by incubation of bacterially-expressed p3 8 MAP 

kinase with IkB, cMyc, EGF-R, cytoplasmic phospholipase A 2 
(cPLA 2 ) , c-Jun, and mutated ATF2 (Thr 69 ' 71 ) and ATP[y- 32 P] 
(Raingeaud et al . (1995) J. Biol. Chem 270:7420, herein 
specifically incorporated by reference) . GST-IkB was 

30 provided by Dr D. Baltimore (Massachusetts Institute of 
Technology) . GST-cMyc (Alvarez et al . (1991) J. Biol. 
Chem. 266:15277), GST-EGF-R (residues 647-688) (Koland et 
al. (1990) Biochem. Biophys . Res. Commun. 166:90), and 
GST-c-Jun (Derijard et al . (1994) supra) have been 
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described. The phosphorylation reaction was terminated 
after 3 0 minutes by addition of Laemmli sample buffer. 
The phosphorylated proteins were resolved by SDS-PAGE and 
detected by autoradiography. The rate phosphorylation of 
5 the substrate proteins was quantitated by Phosphorlmager 
(Molecular Dynamics Inc.) analysis. The relative 
phosphorylation of ATF2 , MBP, EGF-R, and IkB was 1.0, 
0.23, 0.04, and 0.001, respectively. 

Example 16. Binding of o3 8 MAP Kinase to ATF2 
10 Cell extracts expressing epitope- tagged JNK1 and 

p3 8 MAP kinase were incubated with a GST fusion protein 
containing the activation domain of ATF2 (residues 1-109) 
immobilized on GSH agarose. The supernatant was removed 
and the agarose was washed extensively. Western blot 
15 analysis of the supernatant and agarose-bound fractions 
was conducted as follows: proteins were fractionated by 
SDS-PAGE, electrophoretically transferred to an 
Immobilon-P membrane, and probed with monoclonal 
antibodies to phosphotyrosine (PY2 0) and the Flag epitope 

2 0 (M2) . I mmuno complexes were detected using enhanced 

chemiluminescence (Amersham International PLC) . Control 
experiments were performed using immobilized GST. 

Example 17. p3 8 MAP Kinase and JNK1 Activation bv Pro- 
Inflammatory Cytokines and Environmental 
25 Stress 

The effect of phorbol ester, EGF, UV radiation, 
osmotic stress, IL-1, tumor necrosis factor (TNF) , and 
LPS on p3 8 MAP kinase and JNK1 activity were measured in 
immunecomplex protein kinase assays using ATP[-y- 32 P] and 

3 0 ATF2 as substrates. TNFa and IL-lc* were from Genzyme 

Corp. Lipolysaccharide (LPS) was isolated from 
lyophilized Salmonella minesota Re5 95 bacteria as 
described (Mathison et a. (1988) J. Clin. Invest. 
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81:1925) . Phorbol myristate acetate was from Sigma. EGF 
was purified from mouse salivary glands (Davis (198 8) J. 
Biol. Chem. 2 63:9462). Kinase assays were performed 
using immunoprecipitates of p3 8 and JNK. The 

5 immunocomplexes were washed twice with kinase buffer 
(described above) , and the assays initiated by the 
addition of 1 fig of ATF2 and 5 0 /xM [y- 32 P]ATP (10 Ci/mmol) 
in a final volume of 25 fil . The reactions were 
terminated after 3 0 minutes at 30oc by addition of 

0 Laemmli sample buffer. The phosphorylation of ATF2 was 
examined after SDS-PAGE by autoradiography, and the rate 
of ATF2 phosphorylation quantitated by Phosphorlmager 
analysis . 

The results are shown in Table 2. Exposure of 
5 HeLa cells to 10 nM phorbol myristate acetate very weakly 
activated p38 and JNK1 . Similarly, treatment with 10 nM 
EGF only weakly activated p3 8 and JNK1 . By contrast, 
treatment with 40 J/m 2 UV-C, 300 mM sorbitol, 10 ng/ml 
interleukin-1, and 10 ng/ml TNFa strongly activated p3 8 
0 and JNK1 activity. The effect of LPS on the activity of 
p38 was examined using CHO cells that express human CD14 . 
Exposure of CHO cells to 10 ng/ml LPS only slightly 
activated p38 and JNK1 activity. 

TABLE 2. p38 AND JNK1 ACTIVATION BY PRO- INFLAMMATORY 
5 CYTOKINES AND ENVIRONMENTAL STRESS. 



Relative Protein Kinase Activity 

JNK p3 8 


Control 


1.0 


1 . 0 


Epidermal Growth Factor (10 nM) 


1.9 


2 . 1 


Phorbol Ester (10 nM) 


2.3 


2 . 9 


Lipopolysaccharide (10 ng/ml) 


3.6 


3 . 7 


Osmotic Shock (300 mM sorbitol) 


18 . 1 


4.2 


Tumor Necrosis Factor (10 ng/ml) 


19.3 


10.3 


Interleukin-1 (10 ng/ml) 


8 . 9 


6.2 


UV (40 J/m 2 ) 


7.4 


17.1 
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Example 18. o38 MAP Kinase Activation bv Dual 
Phosphorylation on Tvr and Thr 
COS-1 cells expressing wild-type (Thr 180 , Tyr 182 ) or 
mutated (Ala 180 , Phe 182 ) p38 MAP kinase were treated without 
5 and with UV-C (40 J/m 2 ) . The cells were harvested 30 
minutes following exposure with or without UV radiation. 
Control experiments were performed using mock- transf ected 
cells. The level of expression of epitope-tagged p38 MAP 
kinase and the state of Tyr phosphorylation of p38 MAP 
0 kinase was examined by Western blot analysis using the M2 
monoclonal antibody and the phosphotyrosine monoclonal 
antibody PY20. Immune complexes were detected by 
enhanced chemiluminescence . 

Wild-type and mutant p38 were expressed at similar 
5 levels. Western blot analysis showed that UV radiation 
caused an increase in the Tyr phosphorylation of p38. 
The increased Tyr phosphorylation was confirmed by 
phosphoamino acid analysis of p3 8 isolated from 
[ 32 P] phosphate-labeled cells. The results also showed 
0 that UV radiation increased Thr phosphorylation of p38. 
The increased phosphorylation on Tyr and Thr was blocked 
by mutated p38. Wild- type and mutated p3 8 were isolated 
from the COS-1 cells by immunoprecipitation. Protein 
kinase activity was measured in the immune complex using 
5 [y- 32 P]ATP and GST-ATF2 as substrates. The phosphorylated 
GST-ATF2 was detected after SDS-PAGE by autoradiography. 
UV radiation resulted in a marked increase in the 
activity of wild- type p3 8, while the mutant p3 8 was found 
to be catalytically inactive. These results show that 
0 p38 is activated by dual phosphorylation within the Thr- 
Gly-Tyr motif. 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



Example 19. MAP Kinase Phosphatase Inhibits p38 MAP 
kinase Activation 
The cells were treated without and with 4 0 J/m 2 
UV-C. Control experiments were performed using mock- 
5 transfected cells (control) and cells transfectea with 
the catalytically inactive mutated phosphatase mPACl 
(Cys 257 /Ser) and human MKP1 . The activity of p38 MAP 
kinase was measured with an immunecomplex protein kinase 
assay employing [y- 32 P]ATP and GST-ATF2 as substrates. 
10 The expression of PAC1 or MKP1 was found to inhibit p38 
phosphorylation, demonstrating that p38 can be regulated 
by the dual specificity phosphatases PAC1 and MKP1 . 

Example 20. Subcellular Distribution of p38 MAP Kinase 

Epitope- tagged p3 8 MAP kinase was expressed in COS 
15 cells. The cells were treated without or with 40 J/m 2 UV 
radiation and then incubated for 60 minutes at 37oc. The 
p38 MAP kinase was detected by indirect 

immunofluorescence using the M2 monoclonal antibody. The 
images were acquired by digital imaging microscopy and 

2 0 processed for image restoration. 

Immunocvtochemi s try 

Coverslips (22mm x 22mm No. 1; Gold Seal Cover 
Glass; Becton-Dickinson) were pre- treated by boiling in 
0.1 N HC1 for 10 minutes, rinsed in distilled water, 
25 autoclaved and coated with 0.01% poly-L- lysine (Sigma; 
St. Louis MO). The coverslips were placed at the bottom 
of 35 mm multiwell tissue culture plates (Becton 
Dickinson, UK) . Transfected COS-1 cells were plated 
directly on the coverslips and allowed to adhere 

3 0 overnight in Dulbecco's modified Eagle's medium 

supplemented with 5% fetal calf serum (Gibco-BRL) . 
Twenty- four hours post- transfect ion, the cells were 
rinsed once and incubated at 37oc for 3 0 minutes in 25 mM 
Hepes, pH 7.4, 137 mM NaCl , 6 mM KC1 , 1 mM MgCl 2/ 1 mM 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 46 - 

CaCl 2/ 10 mM glucose. The cells were rinsed once with 
phosphate-buffered saline and the coverslips removed from 
the tissue culture wells. Cells were fixed in fresh 4% 
paraformaldehyde in phosphate-buffered saline for 15 
5 minutes at 22 °C. The cells were permeabilized with 0.25% 
Triton® X-100 in phosphate-buffered saline for 5 minutes 
and washed three times in DWB solution (150 mM NaCl , 15 
mM Na citrate, pH 7.0, 2% horse serum, 1% (w/v) bovine 
serum albumin, 0.05% Triton® X-100) for 5 minutes. The 
0 primary antibody (M2 anti-FLAG monoclonal antibody, 

Eastman-Kodak Co., New Haven, CT) was diluted 1:250 in 
DWB and applied to the cells in a humidified environment 
at 22 oc for 1 hour. The cells were again washed three 
times as above and fluorescein isothiocyanate-ccnjugated 
goat anti -mouse Ig secondary antibody (Kirkegaard & Perry 
Laboratories Inc. Gaithersburg, MD) was applied at a 
1:250 dilution for 1 hour at 22oc in a humidified 
environment. The cells were then washed three times in 
DWB and then mounted onto slides with Gel -Mount :3iomeda 
Corp. Foster City, CA) for immunofluorescence analysis. 
Control experiments were performed to assess the 
specificity of the observed immunofluorescence. No 
fluorescence was detected when the transfected cells were 
stained in the absence of the primary M2 monoclonal 
antibody, or mock- transfected cells. 
Dicrital Imaging Microscopy and Image Restoration 

Digital images of the fluorescence distribution in 
single cells were obtained using a Nikon 60x Planapo 
objective (numerical aperture =1.4) on a Zeiss IM-35 
microscope equipped for epif luorescence as previously 
described (Carrington et al . (1990) in: Non- invasive 
Techniq ues in Cell Biology, Fosbett 5c Grinstein, eds . , 
Wiley-Liss, NY; pp. 53-72; Fay et al . (1989) J. Microsci. 
153:133-149). Images of various focal planes were 
obtained with a computer controlled focus mechanism and a 
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thermoelectricaily cooled charged- coupled device camera 
(model 220; Photometries Ltd., Tucson, AZ) . The exposure 
of the sample to the excitation source was determined by 
a computer-controlled shutter and wavelength selector 
5 system (MVI, Avon, MA). The charge -coupled device camera 
and microscope functions were controlled by a 
microcomputer, and the data acquired from the camera were 
transferred to a Silicon Graphics model 4D/GTX 
workstation ( Mount ainview, CA) for image processing. 
10 Images were corrected for non-uniformities in sensitivity 
and for the dark current of the charge coupled device 
detector. The calibration of the microscopy blurring was 
determined by measuring the instrument's point spread 
function as a series of optical sections at 0.125^m 
15 intervals of a 0.3 (im diameter f luorescently labeled 
latex bead (Molecular Probes Inc.) . The image 
restoration algorithm used is based upon the theory of 
ill -posed problems and obtains quantitative dye density 
values within the cell that are substantially more 
20 accurate than those in an unprocessed image (Carringtcn 
et al. (1990) supra; Fay et al . (1989) supra). After 
image processing, individual optical sections of cells 
were inspected and analyzed using computer graphics 
software on a Silicon Graphics workstation. p38 MAP 
25 kinase was observed at the cell surface, in the 

cytoplasm, and in the nucleus. After irradiation, an 
increased localization of cytoplasmic p38 to the 
perinuclear region was detected. 

Example 21. Activation of the MKK Signal Transduction 
3 0 Pathway by Osmotic Shock 

CHO cells were co-transfected with the piasmid 
pCMV-Flag- Jnkl and pRSV-Neo (Derijard et al . (1994) 
supra) . A stable cell line expressing epitope- tagged 
Jnkl (Flag; Immunex Corp.) was isolated by selection with 
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Geneticin (Gibco-BRL) . The cells were incubated with 0, 
100, 150, 300, 600, or 1000 mM sorbitol for 1 hour at 
37oc. The cells were collected in lysis buffer [20 mM 
Tris, pH 7.4, 1% TRITON® X-100, 2 mM EDTA, 137 mM NaCl , 
5 25 mM (3 -glycerophosphate, 1 mM orthovanadate , 2 mM 

pyrophosphate, 10% glycerol, 1 mM phenylmethylsuif onyl 
fluoride, 10 pa /ml leupeptin) and a soluble extract was 
obtained by centrif ugat ion at 100,000 g for 30 minutes at 
4oc. The epitcpe-tagged JNK1 was isolated by 
10 immunoprecipitation with the monoclonal antibody M2 
(Immunex Corp.). The immunoprecipitates were washed 
extensively with lysis buffer. Immune complex kinase 
assays were done in 25 /il of 25 mM Hepes, pH 7.4, 2 5 mM 
MgCl 2 , 25 mM /^-glycerophosphate , 2 mM dithiothreitol , 100 
15 jiM orthovanadate, and 50 /xM ATP [y- 32 P] (10 Ci/mmole) with 
2.5 /zg of bacterially expressed c-Jun (residues 1-79) 
fused to glutathione-S-transf erase (GST) as a substrate. 
The phosphorylation of c-Jun was examined after SDS-PAGE 
by autoradiography and Phosphorlmager (Molecular Dynamics 
20 Inc.) analysis. JNK1 activation was observed at all 
concentrations of sorbitol exposure. 

The time course of JNK1 protein kinase activation 
was measured in cells incubated in medium supplemented 
with 300 mM sorbitol as described above. Increased JNK1 
25 activity was observed within 5 minutes of exposure to 
sorbitol, with maximum activity occurring after 15-30 
minutes . 

Mutation of JNK1 at the phosphorylation sites 
Thr 183 and Tyr 185 blocked the activation of JNK1 protein 
3 0 kinase activity by osmotic shock. CHO cells were 

transfected with vector, wild-type JNK1 (Thr 183 , Tyr 185 ), 
and mutated JNK1 (Ala 183 , Phe 185 ) . The cells were incubated 
in medium supplemented without or with 3 00 mM sorbitol 
for 15 minutes before measurement of JNK1 protein kinase 
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activity as described above. JNK1 activation was seen in 
the wild-type but not mutated JNK1 . 

Example 22. Molecular Cloning of MKK7 

RT-PCR was employed to identify a fragment: of a 
novel mammalian MAP kinase kinase. The primers designed 
for the protocol, ATNGCNGTNAARCARATG (SEQ ID NO: 23) and 
ATNC KYT CNGGNG C C AT RT A (SEQ ID NO: 24), were based on the 
sequence of the Drosophila MAP kinase kinase hep (Glise 
et al. (1995) Cell 83:451-461). Murine testis mRNA was 
used as the template. A single product (461 bp) was 
detected following RT-PCR amplification of murine testis 
mRNA. Sequence analysis identified this PCR product as a 
fragment of a novel mammalian MAP kinase kinase. Full- 
length murine cDNA clones were isolated by screening a 
murine testis library (Stratagene Inc.). The cDNA 
clones were examined by sequencing with an Applied 
Biosystems model 3 73A machine. A group of seven clones 
was identified by sequence analysis to contain a single 
long open reading frame that encoded a putative protein 
kinase (Fig. 9 and Fig. 10; SEQ ID NO: 17 and SEQ ID 
NO: 18) . In- frame termination codons were detected in the 
5' and 3' regions cf these clones. This sequence 
includes protein kinase sub-domains I - XI and is related 
to the MAP kinase kinase group. The novel protein kinase 
was designated MKK7 . The sites of activating 
phosphorylation of MAP kinase kinases located in sub- 
domain VIII are conserved in MKK7 . Comparison of MKK7 
with other members of the mammalian MAP kinase kinase 
group demonstrates that MKK7 is related to the JNK 
activator MKK4 . One additional cDNA clone isolated from 
the X phage library differed from the other seven clones. 
This clone contained the same 3' untranslated region and 
coding region of MKK7 , but had a different 5' region that 
lacked an in- frame termination codon. This clone 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 50 - 

represents an alternatively spliced form of MKK7 (MKK7b ; 
Fig. 11; SEQ ID NO:19). The MKK7b cDNA clone does not 
have an initiation codon in the alternative 5 ' region; 
this cDNA therefore encodes the same MKK7 protein kinase 
5 as the other clones that were isolated. However, if the 
MKK7b cDNA clone is not full-length it is possible that 
additional 5' sequence may include an in-frame initiation 
codon. If true, MKK7b is predicted to fuse the sequence 
M- [?] -SPAPAPSQRAALQLPLANDGGSRSPSSESSPQHPTPPTRPRH- (SEQ ID 

10 NO:33) to the initiating methionine of MKK7 (Fig. 9). 
Although the Drosophila MAP kinase kinase hep shares 
substantial sequence similarity with MKK7 , the sequence 
of the NH2- terminal extension of MKK7b is not conserved 
in the hep protein kinase. Three additional clones 

15 encoded MKK7 splice variants that differ in the 5' and 3' 
regions. These clones (MKK7c (Fig. 13), MKK7d (Fig. 14), 
and MKK7e (Fig. 15)) are full-length because of the 
presence of in- frame termination codons in the 5' and 3' 
regions . 

20 A human cDNA library was screened with a full- 

length mouse MKK7 cDNA probe. A single clone was 
identified and squenced. A partial MKK7 sequence was 
identified (Fig. 12; SEQ ID NO : 25 and SEQ ID NO:2S) that 
is missing the 3' end. The sequence is most homologous 

25 co mouse MKK7c . 

The sequences of MKK7 , MKK7b , hep, and human MKK7 
cDNAs have been deposited in Genbank with accession 
numbers U93030, U93031, U93032, and AF00319 respectively. 

Example 2 3 . Expression of MKK7 
3 0 MKK7 expression was examined by Northern blot 

analysis of mRNA isolated from different tissues. The 
analysis was done with poly A+ mRNA (2 /xg) isolated from 
different tissues and fractionated by denaturing agarose 
gel electrophoresis and transferred to a nylon membrane 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCTYUS98/14101 



- 51 - 

(Clontech) . The blot was probed with MKK4 and MKK7 cDNAs 
labeled by random priming with [a- 32 P]dATP (Amersham 
International PLC) . 

MKK7 was found to be widely expressed in murine 
5 tissues. A single MKK7 transcript (approximately 4.0-kb) 
was detected in all of the tissues examined, except for 
testis where two MKK7 transcripts (4.0 kb and 1.5 kb) 
were detected. The highest levels of MKK7 expression 
were in testis. Significant expression of MKK7 was also 
10 observed in heart, brain, lung, liver, and kidney. This 
contrasts with MKK4 expression which was highest in brain 
although significant amounts of expression were observed 
in brain, liver, muscle, heart, and kidney. Although 
MKK4 and MKK7 are co-expressed, the relative abundance of 
15 each MAP kinase kinase is different in each of the 
tissues examined. 

Example 24. Specific Activation of JNK bv MKK7 in vitro 

To examine the specificity of MKK7, in vitro 
protein kinase assays were performed. A bacterial MKK7 

2 0 expression vector was prepared by sub-cloning an MKK7 

cDNA (Eco RI and Pvu II fragment) into the Eco RI and Sma 
I sites of pGEX-5Xl ( Pharmacia -LKB ) . The glutathione-S - 
transferase (GST) fusion protein was purified by affinity 
chromatography (Smith and Johnson (1988) Gene 67:31-40) . 

25 The recombinant proteins GST-ATF2 (Gupta et al . (1995) 
Science 267:389-3 93), GST-cJun (Derijard (1994) supra), 
GST-cMyc (Alvarez et al . (1991) J. Biol. Chem. 256:15277- 
15285), GST-ERK2 (Seth et al . (1992) J. Biol. Chem. 
267:24796-24804), GST-p38, (Raingeaudet al . (1995) J. 

30 Biol. Chem. 270:7420-7426), and GST-JNK1 (Derijard (1994) 
supra) have been described. 

Protein kinase assays were performed in kinase 
buffer (25 mM 4 - (2 -hydroxyethyl ) -1 - - 
piperazineethansulfonic acid (pH 7.4), 25 mM /?- 

35 glycerophosphate, 25 mM MgCl 2/ 2 mM dithiothreitol , 0 . 1 mM 
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orthovanadate) . The assays were initiated by the 
addition of 1 fig of substrate proteins and 50 fxM [y- 
32P]ATP (10 Ci/mmoi) in a final volume of 25 fil . The 
reactions were terminated after 3 0 minutes at 25 3 C by 
5 addition of Laemmli sample buffer. The phosphorylation 
of the substrate proteins was examined after SDS- 
polyacrylamide gel electrophoresis (PAGE) by 
autoradiography . 

Recombinant MAP kinases were incubated with GST 
0 (control) or GST-MKK7 using the substrate ATP[y- 32 P]. 

Recombinant MKK7 purified from bacteria was not observed 
to autophosphoryiate . Incubation of the recombinant MKK7 
with MAP kinases demonstrated that MKK7 phosphorylated 
JNK1, but not p3 8 or ERK2 . MKK7 was phosphorylated by 
5 p3 8 and JNK1 . The significance of the 

retrophosphorylation of the MAP kinase kinase by the MAP 
kinase is unclear, but similar retrophosphorylation has 
been detected in kinase assays using MKK4 (Derijard 
(1995) supra) and the Drosophila JNK activator hep (Sluss 
0 (1996) supra) . 

To test whether the phosphorylation of JNK1 by 
MKK7 caused increased protein kinase activity, 
experiments using ATF2 as the JNK substrate were 
performed. GST-MKK7 was incubated in a protein kinase 
5 assay with recombinant JNK1 . JNK activity was measured 
by including the JNK substrate ATF2 in each assay. ATF2 
was not phosphorylated by MKK7 , but was weakly 
phosphorylated by JNK1 . Incubation of MKK7 with JNK1 
caused phosphorylation of JNK1 and a large increase in 
0 ATF2 phosphorylation. These data indicate that MKK7 
phosphorylates and activates JNK1 . To confirm this 
conclusion, the effect of replacement of the JNK dual 
phosphorylation motif Thr-Pro-Tyr with Ala-Pro-Phe was 
examined. MKK7 did not phosphorylate the mutated JNK1 
(APF) protein. Furthermore, MKK7 did not increase ATF2 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 53 - 

phosphorylation by the mutated JNK1 protein kinase. 
Thus, MKK7 is a JNK activator in vitro. 

Example 25. Specific Activation of JNK by MKK7 in vivo 
To examine the specificity of MKK7 in vivo, 
5 cotransf ection assays were performed. CHO cells were 
maintained in Dulbecco's modified Eagle's medium 
supplemented with fetal calf serum (5%; Gibco-BRL) . The 
cells were transfected with the lipof ectamine reagent 
according to the manufacturer's recommendations (Gibco- 
0 BRL) (Derijard (1994) supra). Cells were co- transfected 
with vectors encoding epitope- tagged JNK1 together with 
an empty expression vector (control) or an expression 
vector encoding MKK4 or MKK7 . The epitope tag was 
derived from the hemagglutinin protein (HA) of the 
5 influenza virus. JNK1 was isolated by 

immunoprecipitation of cell lysates. The cells were 
solubilized with lysis buffer (20 mM Tris (pH 7.4), 1% 
TRITON X-100®, 10% glycerol, 137 mM NaCl , 2 mM EDTA, 25 
mM ^-glycerophosphate, 1 mM Na crthovanadate , 2 mM 
pyrophosphate, 1 mM PMSF, 10 /xg/ml leupeptin) and 
centrifuged at 100,000 X g for 15 minutes at 4°C. The 
epitope- tagged protein kinases were immunoprecipitated by 
incubation for 3 hours at 4°C with an ant i -HA monoclonal 
antibody bound to protein-G Sepharose (Pharmacia-LKB 
Biotechnology Inc.). The immunoprecipitates were washed 
three times with lysis buffer (Gupta et al . (1995) 
Science 267:389-393). Protein kinase activity was 
measured in the immunecomplex with [y- 32 P]ATP and c-Jun as 
substrates. The product of the phosphorylation reaction 
was visualized after SDS-PAGE by autoradiography. The 
ERK2 and p3 8 MAP kinases were not activated by co- 
expressed MKK7. Control experiments demonstrated that 
the ERK2 and p3 8 MAP kinases were activated by their 
respective cognate MAP kinase kinases, MKK1 and MKK6 . In 
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contrast, MKK7 did activate JNK1. Interestingly, the 
activation of JNK1 by co-expressed MKK7 was greater than 
that caused by the previously described JNK activator 
MKK4. Together, chese data establish that MKK7 can 
5 function as a specific activator of JNK in cultured 
cells . 

Example 26. Activation of the JNK Signal Transduction 
Pathway by MKK7 
The JNK signaling pathway is known to regulate AP- 
0 1 transcriptional activity (Whitmarsh (1996) supra) . To 
test the hypothesis than the expression of MKK7 would 
cause increased AP-1 transcriptional activity, a co- 
transfection assay was employed using a luciferase 
reporter gene that contains three AP-1 sites cloned 
5 upstream of a minimal promoter element (Rincon and 
Flavell (1994) EMBO J. 13:4370-4381). Luciferase 
reporter gene expression was measured in co- transf ect ion 
assays using the 0.5 fig of the reporter plasmid pTRE- 
lucif erase (Rincon (1994) supra) and 0.25 fig of the 0- 
galactosidase expression vector pCHHO ( Pharmacia -LKB) . 
Experiments using GAL 4 fusion proteins were performed 
using 0.25 fig of pGAL4 - ATF2 (residues 1-109), 0.5 fig of 
the reporter plasmid pGSElbLuc, and 0.25 fig of pCHHO 
(Gupta et al. (1995) supra). The effect of pro-ein 
kinases was examined by co- transf ection with 0.3 /ig cf an 
empty expression vector or a protein kinase expression 
vector. The ERK2 , p3 8, JNK1 , MKK1 , MKK3 , MKK4 , and MKK6 
expression vectors have been described. The cells were 
harvested 3 6 hours post - transf ection . The 0- 
galactosidase and luciferase activity in the cell lysates 
was measured as described (Gupta (1995) supra) 
Expression of MKK4 , MKK7 , or JNK1 did not cause marked 
changes in AP-1 reporter gene expression (Fig. ISA) . In 
contrast, co-expression of MKK7 with JNKl caused 
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increased AP-l -dependent reporter gene expression. 
Consistent with the observation that MKK4 causes weaker 
activation of JNK than MKK7, co-expression of MKK4 with 
JNK caused a smaller increase in AP-1 reporter gene 
5 expression (Fig. 16A) . Together, these data demonstrate 
that MKK7 can function as an activator of the JNK signal 
transduction pathway. 

To further examine the effect of MKK7 on 
transcriptional activity, the effect of MKK7 on the 
10 transcription factor ATF2 was investigated. Previous 
studies have demonstrated that ATF2 is a target of the 
JNK signal transduction pathway (van Dam et al . (1995) 
supra; Gupta et al . (1995) supra; Livingstone et al 
(1995) supra) . JNK phosphorylates two sites (Thr-69 and 
15 Thr-71) in the NH 2 - terminal activation domain of ATF2 and 
increases transcriptional activity. A GAL4 fusion 
protein strategy was employed to monitor the 
transcriptional activity of the activation domain of ATF2 
(Gupta (1995) supra) . Measurement of reporter gene 
2 0 expression demonstrated that the coexpression of MKK4 

with JNK1 caused increased transcriptional activity (Fig. 
16B) . A similar level of reporter gene expression was 
caused by expression of MKK7 and a larger increase was 
detected when MKK7 was co-expressed with JNK1 . The more 
25 potent effect of MKK7 , compared with MKK4 , on 

transcriptional activity is consistent with the relative 
effects of MKK7 and MKK4 on JNK activation. To confirm 
that the increased reporter gene expression was mediated 
by ATF2 phosphorylation, the effect of replacement of the 
30 sites of ATF2 phosphorylation (Thr-69 and Thr-71) with 
Ala was examined. The mutated ATF2 protein was not 
regulated by MKK4 , MKK7 , or JNK1 (Fig. 16B) . Together, 
these data demonstrate that MKK7 can regulate a 
physiological target of the JNK signaling pathway. 
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Use 

The MKK polypeptides and polynucleotides of the 
invention are useful for identifying reagents that 
modulate the MKK signal transduction pathways. Reagents 
5 that modulate an MKK signal transduction pathway can be 
identified by their effect on MKK synthesis, MKK 
phosphorylation, or MKK activity. For example, the 
effect of a reagent on MKK activity can be measured by 
the in vitro kinase assays described above. MKK is 
0 incubated without (control) and with a test reagent under 
conditions sufficient to allow the components to react, 
then the effect of the test reagent on kinase activity is 
subsequently measured. Reagents that inhibit an MKK 
signal transduction pathway can be used in the treatment 
5 of MKK-mediated disorders. Reagents that stimulate an 
MKK signal transduction pathway can be used in a number 
of ways, including induction of programmed cell death 
(apoptosis) in tissues. For example, the elimination of 
UV damaged cells can be used to prevent cancer. 

Generally, for identification of a reagent that 
inhibits the MKK signal transduction pathway, a kinase 
assay (see, for example, Example 2) is used. A range of 
reagent concentrations (e.g., 1.0 nM to 100 mM) are added 
to a test system that includes an MKK substrate and a 
radioactive marker such as [y- 22 P]ATP. Appropriate 
substrate molecules include p3 8, JNK1, JNK2 , or ATF2 . 
The incorporation of labelled phosphorus (e.g., [ 32 ] P or 
[ 33 ]P) into the substrate is determined, and the results 
obtained with the test reagent compared to control 
values. Of particular interest are reagents that result 
in inhibition of [ 32 ] P incorporation of about 80% or more. 
Phosphorylation may also be examined using a reagent that 
is phosphorylation-dependent , for example, an antibody. 
Phosphorylation-dependent antibodies may be made using 
MKK7 phosphorylated on the activating sites, Ser 98 and 
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Thr 202 . This may be accomplished by immunizing animals 
with a synthetic peptide (for example, approximately 15 
amino acids in length) corresponding to the MKK7 sequence 
with phosphorylated Ser 198 and Thr 202 . Methods of producing 
5 such antibodies are known in the art. Such antibodies 
are useful for the detection of activated MKK7 in tissues 
and cell extracts (e.g. on Western blots) and may be used 
in a kit. 

Assays that test the effect of a reagent on MKK 
10 synthesis can also be used to identify compounds that 

inhibit MKK signal transduction pathways. The effect of 
the test reagent on MKK expression is measured by, for 
example, Weste rn blot analysis with an antibody specific 
for an MKK. Antibody binding is visualized by 
15 autoradiography or chemiluminescence , and is quantitated. 
The effect of the test reagent on MKK mRNA expression can 
be examined, for example, by Northern blot analysis using 
a polynucleotide probe or by polymerase chain reaction. 

Reagents found to inhibit MKK signal transduction 

2 0 pathways can be used as therapeutic agents for the 

treatment of MKK-mediated disorders. Such reagents are 
also useful in drug design for elucidation of the 
specific molecular features needed to inhibit MKK signal 
transduction pathways. 

25 In addition, the invention provides a method for 

the treatment of MKK-mediated stress -related and 
inflammatory disorders. The method includes 
administration of an effective amount of a therapeutic 
reagent that inhibits MKK function. Suitable reagents 

30 inhibit either MKK activity or expression. The 

concentration of the reagent to be administered is 
determined based on a number of factors, including the 
appropriate dosage, the route of administration, and the 
specific condition being treated. The appropriate dose 

3 5 of a reagent is determined by methods known to those 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



- 58 - 

skilled in the art including routine experimentation to 
optimize the dosage as necessary for the individual 
patient and specific MKK-mediated disorder being treated. 
Specific therapeutically effective amounts appropriate 
5 for administration are readily determined by one of 

ordinary skill in the art (see, for example, Remington' s 
Pharmaceutical Sciences . 13th ed. , Gennaro, ed., Mack 
Publishing Company, East on, PA, 19 90) . Dosages may range 
from about 0.1-10 mg/kilo/day. 

10 The invention provides methods for both acute and 

prophylactic treatment of stress-related and inflammatory 
disorders. For example, it is envisioned that ischemic 
heart disease will be treated during episodes of ischemia 
and oxidative stress following reperfusion. In addition, 

15 a patient at risk for ischemia can be treated prior to 
ischemic episodes . 

In another example, a therapeutic agent that 
inhibits MKK function or activity is administered to 
control inflammatory responses by inhibiting the 

2 0 secretion of inflammatory cytokines, including TNF and 

IL-1 . 

Stress-related proliferative disorders can also be 
treated by the method of the invention by administering a 
therapeutic reagent that inhibits MKK function or 
25 activity. Such therapeutic reagents can be used alone or 
in combination with other therapeutic reagents, for 
example, with chemotherapeutic agents in the treatment of 
malignancies. Indeed, the control of stress-activated 
MKK by the therapeutic reagents provided by this 

3 0 invention can modulate symptoms caused by other 

therapeutic strategies that induce stress. 

The therapeutic reagents employed are compounds 
which inhibit MKK function or activity, including 
polynucleotides, polypeptides, and other molecules such 
35 as antisense oligonucleotides and ribozymes, which can be 
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made according to the invention and techniques known to 
the art. Polyclonal or monoclonal antibodies (including 
fragments or derivatives thereof) that bind epitopes of 
MKK also can be employed as therapeutic reagents. 
5 Dominant -negative forms of MKK which effectively displace 
or compete with MKK for substrate binding and/or 
phosphorylation can be used to decrease protein kinase 
activity. Dominant -negative forms can be created by 
mutations within the catalytic domain of the protein 
0 kinases, using methods known in the art, and as described 
above (Example 13) . The catalytic residues are conserved 
in all the MKK isoforms. For example, mutation of Lys 76 
inhibits MKK7 activity. Similarly, mutation of the 
conserved sites of activating phosphorylation (Ser 198 , 
5 Thr 202 ) inhibits MKK7 activity. These kinase- inactive 
forms of MKK7 act as dominant -negative inhibitors. 

In some cases, augmentation of MKK activity is 
desirable, e.g., induction of apoptosis. The methods of 
the invention can be used to identify reagents capable of 
0 increasing MKK function or activity. Alternatively, 

increased activity is achieved by over-expression of MKK. 
When an MKK-mediated disorder is associated with under- 
expression of MKK, or expression of a mutant MKK 
polypeptide, a sense polynucleotide sequence (the DNA 
5 coding strand) or MKK polypeptide can be introduced into 
the cell to enhance normal MKK activity. If necessary, 
these treatments are targeted to specific cells by their 
mode of administration (e.g., by use of cell-type 
specific viral vectors) , or by placing MKK7 nucleic acids 
0 in recombinant constructs with cell -type specific or 
inducible promotors by methods known in the art. For 
example, MKK7 nucleic acid-containing vectors can be 
constructed by recombinant DNA technology methods 
standard in the art. Vectors can be plasmid, viral, or 
others known in the art, used for replication and 
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expression in mammalian cells. Expression of the 
sequence encoding the MKK7 nucleic acid can be by any 
promoter known in the art to act: in mammalian, preferably 
human cells. Such promoters can be inducible or 
5 constitutive. Such promoters include, but are not 
limited to: the SV40 early promoter region (Bernoist 
et al., Nature 290:304, 1981); the promoter contained in 
the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al . , Cell 22:787-797, 1988); the herpes 

10 thymidine kinase promoter (Wagner et al . , Proc . Natl. 
Acad. Sci. USA 78:1441, 1981); or the regulatory 
sequences of the metallothionein gene (Brinster et al . , 
Nature 296:39, 1988). 

The antibodies of the invention can be 

15 administered parenterally by injection or by gradual 
infusion over time. The monoclonal antibodies cf the 
invention can be administered intravenously, 
intraperitoneal ly , intramuscularly, subcutaneous ly, 
intracavity, or transdermally . 

20 Preparations for parenteral administration of a 

polypeptide or an antibody of the invention include 
sterile aqueous or non-aqueous solutions, suspensions, 
and emulsions. Examples of non-aqueous solvents are 
propylene glycol, polyethylene glycol, vegetable oils 

25 such as olive oil, and injectable organic esters such as 
ethyl oleate. Aqueous carriers include water, 
alcoholic/aqueous solutions, emulsions or suspensions, 
including saline and buffered media. Parenteral vehicles 
include sodium chloride solution, Ringer's dextrose, 

30 dextrose and sodium chloride, lactated Ringer's, or fixed 
oils. Intravenous vehicles include fluid and nutrient 
replenishers , electrolyte replenishers (such as those 
based on Ringer's dextrose) and the like. Preservatives 
and other additives can also be present, such as, for 
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example, antimicrobials, antioxidants, chelating agents, 
and inert gases, and the like. 

Polynucleotide sequences, including antisense 
sequences, can be therapeutically administered by various 
5 techniques known to those skilled in the art. Such 
therapy would achieve its therapeutic effect by 
introduction of the MKK polynucleotide into cells of 
mammals having a MKK-mediated disorder. Delivery of MKK 
polynucleotides can be achieved using free polynucleotide 
0 or a recombinant: expression vector such as a chimeric 
virus or a colloidal dispersion system. Especially 
preferred for therapeutic delivery of nucleotide 
sequences is the use of targeted liposomes. 

Targeting of the therapeutic reagent to specific 
5 tissues is desirable to increase the efficiency of 
delivery. The targeting can be achieved by passive 
mechanisms via the route of administration. Active 
targeting to specific tissues can also be employed. The 
use of liposomes, colloidal suspensions, and viral 
vectors allows targeting to specific tissues by changing 
the composition of the formulation containing the 
therapeutic reagent:, for example, by including molecules 
that act as receptors for components of the target 
tissues. Examples include sugars, glycoplipids , 
polynucleotides, or proteins. These molecules can be 
included with the therapeutic reagent. Alternatively, 
these molecules can be included by indirect methods, for 
example, by inclusion of a polynucleotide that encodes 
the molecule, or by use of packaging systems that provide 
targeting molecules. Those skilled in the art will know, 
or will ascertain with the use of the teaching provided 
herein, which molecules and procedures will be useful for 
delivery of the therapeutic reagent to specific tissues. 
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Transgenic animals 

MKK polypeptides can also be expressed in 
transgenic animals. These animals represent a model 
system for the study of disorders that .are caused by or 

5 exacerbated by overexpression or underexpression of MKK, 
and for the development of therapeutic agents that 
modulate the expression or activity of MKK. For example, 
dominant -negative and constitutively activated alleles 
could be expressed in mice to establish physiological 

0 function. 

Transgenic animals can be farm animals (pigs, 
goats, sheep, cows, horses, rabbits, and the like) 
rodents (such as rats, guinea pigs, and mice), non-human 
primates (for example, baboons, monkeys, and 
5 chimpanzees) , and domestic animals (for example, dogs and 
cats) . Transgenic mice are especially preferred. 

Any technique known in the art can be used to 
introduce a MKK transgene into animals to produce the 
founder lines of transgenic animals. Such techniques 
0 include, but are not limited to, pronuclear 

microinjection (U.S. Pat. No. 4,873,191); retrovirus 
mediated gene transfer into germ lines (Van der Putten 
et al., Proc. Natl. Acad. Sci . , USA 82:6148, 1985); gene 
targeting into embryonic stem cells (Thompson ez al . , 
5 Cell 56:313, 1989); and electroporat ion of embryos (Lo, 
Mol. Cell. Biol. 3:1803, 1983). Especially useful are 
the methods described in Yang et al . (Proc. Natl Acac. 
Sci. USA 94:3004-3009, 1997) 

The present invention provides for transgenic 
0 animals that carry the MKK transgene in all their cells, 
as well as animals that carry the transgene in some, but 
not all of their cells. That is, the invention provides 
for mosaic animals. The transgene can be integrated as a 
single transgene or in concatamers, e.g., head-to-head 
tandems or head-to-tail tandems. The transgene can also 
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particular cell type (Lasko et al . , Proc . Natl. Acad. 
Sci. USA 89:6232, 1992). The regulatory sequences 
required for such a cell-type specific activation will 
5 depend upon the particular cell type of interest, and 
will be apparent to those of skill in the art. 

When it is desired that the MKK transgene be 
integrated into the chromosomal site of the endogenous 
MKK gene, gene targeting is preferred. Briefly, when 
10 such a technique is to be used, vectors containing some 
nucleotide sequences homologous to an endogenous MKK gene 
are designed for the purpose of integrating, via 
homologous recombination with chromosomal sequences, into 
and disrupting the function of the nucleotide sequence of 
15 the endogenous gene. The transgene also can be 

selectively introduced into a particular cell type, thus 
inactivating the endogenous MKK gene in only that cell 
type (Gu et al . , Science 265:103, 1984). The regulatory 
sequences required for such a cell -type specific 
2 0 inactivation will depend upon the particular cell type of 
interest, and will be apparent to those of skill in the 
art. These techniques are useful for preparing "knock 
outs'* having no functional MKK gene. 

Once transgenic animals have been generated, the 

2 5 expression of the recombinant MKK gene can be assayed 

utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PCR techniques 
to determine whether integration of the transgene has 
taken place. The level of mRNA expression of the 

3 0 transgene in the tissues of the transgenic animals may 

also be assessed using techniques which include, but are 
not limited to, Northern blot analysis of tissue samples 
obtained from the animal, in situ hybridization analysis, 
and RT-PCR. Samples of MKK gene -expressing tissue can 
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also be evaluated immunocytochemically using antibodies 
specific for the MKK transgene product. 

For a review of techniques that can be used to 
generate and assess transgenic animals, skilled artisans 
can consult Gordon (Intl. Rev. Cytol . 115:171-229, 1989), 
and may obtain additional guidance from, for example: 
Hogan et al . Manipulating the Mouse Embryo , Cold Spring 
Harbor Press, Cold Spring Harbor, NY, 1986)/, Krimpenfort 
et al. (Bio/Technology 9:86, 1991), Palmiter et al . (Cell 
41:343, 1985), Kraemer et al . ( Genetic Manipulation of 
the Ear ly Mammalian Embrvo . Cold Spring Harbor Press, 
Cold Spring Harbor, NY, 1985), Hammer et al. (Nature 
315:680, 1985), Purcel et al . (Science, 244:1281, 1986), 
Wagner et al . (U.S. Patent No. 5,175,385), and 
Krimpenfort et al . (U.S. Patent No. 5,175,384) (the 
latter two publications are hereby incorporated by 
reference) . 

Other Embodiments 
It is to be understood that while the invention 
has been described in conjunction with the detailed 
description thereof, that the foregoing description is 
intended to illustrate and not limit the scope cf the 
invention, which is defined by the scope of the appended 
claims. Other aspects, advantages, and modif ications are 
within the scope of the following claims. 
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CLAIMS 

1. A substantially pure mammalian mitogen- 
activated protein kinase kinase (MKK7) polypeptide having 
serine, threonine, and tyrosine kinase activity, and 
phosphorylating mitogen-activated protein (MAP) kinase 
JNK, but not p3 8 . 

2. A polypeptide of claim 1 comprising the amino 
acid sequence of SEQ ID NO: 18, SEQ ID NO: 20, SEQ ID 

NO: 26, SEQ ID NO: 28, SEQ ID NO: 30, or SEQ ID NO: 32. 

3 . An isolated polynucleotide sequence encoding a 
polypeptide of claim 1. 

4. An isolated polynucleotide sequence of claim 3 
comprising the sequence of SEQ ID NO: 17, SEQ ID NO: 19, 
SEQ ID NO:25, SEQ ID N0:27, SEQ ID NO:29, or SEQ ID 

NO: 31, or degenerate variants thereof, or a 
polynucleotide sequence fully complementary to the 
sequence of SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 25, SEQ 
ID NO:27, SEQ ID NO:29, or SEQ ID NO:31, or degenerate 
variants thereof. 

5. An isolated polynucleotide sequence of claim 3 
comprising a polynucleotide sequence that hybridizes 
under stringent hybridization conditions to the sequence 
of SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO: 25, SEQ ID 

NO: 27, SEQ ID NO : 2 9 , SEQ ID NO: 31, or a complement 
thereof . 

6. A recombinant expression vector containing a 
polynucleotide sequence of claim 3 . 

7. A recombinant host cell comprising a 
polynucleotide sequence of claim 3 . 
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8. A purified antibody that binds specifically to 
a polypeptide of claim 1. 

9. A purified antibody that binds specifically to 
a polypeptide of claim 2 . 

10. A method of measuring the activity of a 
mitogen-activated protein kinase kinase (MKK7) in a 
biological test sample, said method comprising: 

a) incubating said test sample with an MKK 
substrate for the MKK polypeptide of claim 1 and labeled 
phosphate under conditions sufficient to allow 
phosphorylation of said substrate; and 

b) determining the rate of incorporation of 
labeled phosphate into said substrate, wherein said rate 
of incorporation is a measure of MKK7 activity. 

11. A method of claim 10, wherein said MKK 
substrate is selected from the group consisting of JNK 
MAP kinases, activating transcription factor-2 (ATF2), 
ATFa, cAMP response element binding protein (CRE-BPa) , 
and c-Jun. 

12. A method of claim 10, wherein said biological 
test sample is fluid, cells, or tissue obtained from a 
mammal . 

13 . A method for measuring the synthesis of MKK7 
in a biological test sample, the method comprising the 
steps of: 

a) obtaining a biological sample; 

b) contacting said biological sample with an 
antibody that specifically binds an MKK7 polypeptide of 
claim 1; and 

c) detecting said antibody bound to MKK7 
polypeptide, wherein the level of MKK7 synthesis is 
determined by the amount of bound antibody. 
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14. A method for measuring the level of 
expression of MKK7 in a test sample, the method 
comprising the steps of: 

a) isolating total or polyadenylated RNA from the 
test sample; 

b) incubating the RNA wich a polynucleotide probe 
specific for an MKK7 polynucleotide of claim 3 ; and 

c) determining the amount of said probe hybridized 
to the RNA, wherein the level of expression of MKK7 is 
directly related to the amount of MKK7 probe hybridized 
to the RNA. 

15. A method for identifying a reagent that 
modulates MKK7 activity, said method comprising: 

a) obtaining a test sample containing MKK7 ; 

b) incubating said test sample with an MKK 
substrate for the MKK polypeptide of claim 1, a range of 
reagent concentrations, and labeled phosphate under 
conditions sufficient to allow phosphorylation of said 
subtrate when said reagent is not present; 

c) detecting phosphorylation of said substrate; 

and 

d) comparing the effect of said reagent on MKK7 
activity relative to a control, wherein any variation 
compared to control indicates a reagent able to modulate 
MKK7 substrate phosphorylation. 

16. A method of claim 15, wherein said MKK7 
substrate is one or more of JNK, ATF2 , ATFa, CRE-BPa, and 
c-Jun. 

17. A method of claim 15 wherein said modulation 
is inhibition of MKK7 activity. 

18. A method for identifying a reagent that 
modulates MKK7 synthesis, said method comprising: 

a) providing a sample capable of MKK7 synthesis; 
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b) incubating said sample with a range of reagent 
concentrations under conditions that allow synthesis of 
MKK7 when said reagent is not present; 

c) detecting an MKK7 polypeptide of claim 1; and 

d) comparing the effect of said reagent on MKK7 
synthesis relative to a control, wherein any variation 
compared to control indicates a reagent able to modulate 
MKK7. 

19. A method of claim 18 wherein said modulation 
is inhibition of MKK7 synthesis. 

20. A method for identifying a reagent that 
modulates MKK7 expression, said method comprising: 

a) providing a sample capable of expressing MKK7 ; 

b) incubating said sample with a range of 
concentrations of said reagent under conditions where 
MKK7 is expressed in the absence of said reagent; 

c) isolating total or polyadenylated RNA from the 

sample ; 

d) incubating the RNA with a polynucleotide probe 
specific for a MKK7 nucleic acid of claim 3; and 

e) comparing the effect of said reagent on MKK7 
RNA expression relative to a control, wherein any 
variation compared to control indicates a reagent able to 
modulate MKK7 expression. 

21. A method of treating an MKK7 -mediated 
disorder in a patient, the method comprising 
administering to the patient a therapeutically effective 
amount of a reagent that modulates MKK7 activity. 

22. The method of claim 21, wherein said MKK7- 
mediated disorder is selected from the group consisting 
of ischemic heart disease, kidney failure, oxidative 
liver damage, respiratory distress syndrome, heat and 
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radiation burns, septic shock, rheumatoid arthritis, 
autoimmune disorders, and inflammatory diseases. 

23. A method of treating an MKK7-associated 
disorder in a patient, comprising administering to the 
patient a therapeutically effective amount of an MKK7 
polypeptide . 

24. The method of claim 23, wherein said MKK7- 
associated disorder is ischemic heart disease, kidney 
failure, oxidative liver damage, respiratory distress 
syndrome, heat and radiation burns, septic shock, 
rheumatoid arthritis, autoimmune disorders, or 
inflammatory diseases. 
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5 10 15 20 25 30 35 40 45 50 55 60 
* * * * • * 

TGGCTGGCAA TGGCCTTGCT GACCTCGAGC CGGGCCCACG TGGGGACCT? TGGAGCACAG 
ACCGACCGTT ACCGGAACGA CTGGAGCTCG ZCCCGGG7GC ACCCCTGGAA ACCTCGTGTC 

65 70 75 30 35 90 95 100 105 112 115 120 

CCTACGATCC TGGTGCAAGG CCGGTGGATG CAGAGGCCAG TCCATATAGC ACCCAGGCCT 
GGATGCTAGG ACCACGTTCC GGCCACCTAC GTCTCCGGTC AGGTATATGG TGGGTCCGGA 

125 130 135 140 145 150 155 160 165 173 175 180 
» • * * • # 

GCGAGGAGCG TGGTCCCCAC CCATCCAGCC CATATGTGCA AGTGCCCTTG ACAGAGAGGC 
(3GCICCTCGC ACCAGGGGTG GGTAGGTCGG GTATACACGT TCACGGGAAC TGTCTCTCCG 

185 190 195 200 205 210 215 220 225 230 23* 240 

TGGTCATATC CATGGTGACC ATTTATGGGC CACAACAGGT CCCCATCTSC GCAGTGAACC 
ACCAGTATAG GTACCACTGG TAAATACCCG GTGTTGTCCA GGGGTAGACG CGTCACTTGG 

245 250 255 260 265 270 275 280 285 290 295 300 

* • * * * # 

CTGTGCIGAG CACCTTGCAG ACGTGATCTT GCTTCGTCCT GCAGCACTGT GCGGGGCAGG 
GACACGACTC GTGGAACGTC TGCACTAGAA CGAAGCAGGA GGTCGTGACA CGCCCCGTCC 

305 310 315 320 325 330 335 340 345 350 355 

* * * * * 

AAAATCCAAG AGGAAGAAGG ATCTACGGAT ATCCTGC ATG TCC AAG CCA CCC GCA 
TTTTAGGTTC TCCTTCTTCC TAGATGCCTA TAGGACG TAC AGG TTC GGT GGG CGT 

Met Ser Lys Pro Pro Ala> 

360 365 370 375 380 385 390 395 400 
* * * * » 

CCC AAC CCC ACA CCC CCC CGG AAC CTG GAC TCC CGG ACC TTC ATC ACC 
GGG TTG GGG TGT GGG GGG GCC TTG GAC CTG AGG GCC TGG AAG TAG TGG 
Pro Asn Pro Thr Pro Pro Arg Asn Leu Asp Ser Arg Thr Phe He Thr> 

405 410 415 420 425 430 435 440 445 450 
* * * » # 

ATT GGA GAC AGA AAC TTT GAG GTG GAG GCT GAT GAC TTG GTG ACC ATC 
TAA CCT CTG TCT TTG AAA CTC CAC CTC CGA CTA CTG AAC CAC TGG TAG 
He Gly Asp Arg Asn Phe Glu Val Glu Ala Asp Asp Leu Val Thr Ile> 

455 460 465 470 475 480 485 490 495 

* » * # 

TCA GAA CTG GGC CGT GGA GCC TAT GGG GTG GTA GAG AAG GTG CGG CAC 
ACT CTT GAC CCG GCA CCT CGG ATA CCC CAC CAT CTC TTC CAC GCC GTG 
Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val Val Glu Lys Val Arg His> 

500 505 510 515 520 525 530 535 540 545 
* * * # # 

GCC CAG AGC GGC ACC ATC ATG GCC GTG AAG CGG ATC CGG GCC ACC GTG 
CGG GTC TCG CCG TGG TAG TAC CGG CAC TTC GCC TAG GCC CGG TGG CAC 
Ala Gin Ser Gly Thr He Met Ala Val Lys Arg He Arg Ala Thr Val> 

550 555 560 565 570 575 580 585 590 595 
***** 

AAC TCA CAG GAG CAG AAG CGG CTG CTC ATG GAC CTG GAC ATC AAC ATG 
TTG AGT GTC CTC GTC TTC GCC GAC GAG TAC CTG GAC CTG TAG TTG TAC 
Asn Ser Gin Glu Gin Lys Arg Leu Leu Met Asp Leu Asp lie Asn Met> 

FIG. 4A 
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600 605 610 615 620 525 630 635 £40 

CGC ACG GTC GAG TGT TTC TAC ACT GTC ACC TTC TAC GGG GCA CTA 
GCG TGC CAG CTG ACA AAG ATG TGA CAG TGG AAG ATG CCC CGT GAT AAG 
Arg Thr Val Asd Cys Phe Tyr Thr Val Thr Phe Tyr Gly Ala Leu Phe> 



645 


650 


655 


6 


;60 
* 


665 


670 
* 


575 


680 
* 


685 


690 
• 


AGA GAG 
TCT CTC 
Arg Giu 


GGA 
CCT 

Gly 


Asp 


CTG 

Val 


TGG 
ACC 
Trp 


ATC 
TAG 
He 


TGC 
ACG 
C/s 


ATG 
TAC 
Met 


GAG 
Glu 


CTC 
GAG 
Leu 


ATG 
TAC 
Met 


GAC 
CTC 
Asp 


ACA 
TGT 
Thr 


AGG 
Ser 


TTG 
AAC 
Leu> 


695 


700 




'05 


710 


715 


720 
» 


725 


730 

* 


735 




GAC 
CTG 
Asp 


AAG 
TTC 

Lys 


* 

TTC 
AAG 
Phe 


ATG 
Tyr 


CGG 
GCC 
Arg 


AAG 
TTC 

Lys 


GTG 
CAC 
Val 


CTG 

Leu 


GAT 
CTA 
Asp 


AAA 
TTT 

Lys 


AAC 
TTG 
Asn 


ATG 
TAC 
Met 


ACA 
TGT 


ATT 
TAA 
He 


CCA 
GGT 
Pro 


/—% 

CTC 
Giu> 


740 


745 


750 


"55 


-60 


765 


770 


775 


780 


735 


* 

GAC 
CTG 
Asp 


ATC 
TAG 
lie 


GAA 
Leu 


GGG 
CCC 

Gly 


GAG 
Glu 


> TTTI 

r\ i. * 

TAA 
He 


GCT 
CGA 
Ala 


CAC 
Val 


TCT 
AGA 
Ser 


ATC 
TAG 
He 


GTG 

wit- 

Val 


CGG 
GCC 
Arg 


CGG 
Ala 


CTG 
GAC 
Leu 


Glu 


CAT 
GTA 
His> 


790 


795 


300 


905 


810 
• 


815 


320 

* 


825 


830 
* 


935 


* 

CTG 
GAC 
Leu 


CAC 
GTG 
His 


AGC 
TCG 
Ser 


AAG 
TTC 
Lys 


CTG 
GAC 
Leu 


TCG 
AGC 
Ser 


GTG 
CAC 
Val 


ATC 
TAG 
He 


CAC 
GTG 
His 


AGA 
TCT 
Arg 


GAT 
CTA 
Asp 


GTG 
CAC 
Val 


AAG 
T rT ^C 
Lys 


CCC 
GGG 
Pro 


Ser 


AAT 
TTA 
Asn> 


840 


845 


950 
* 


855 


860 
* 


965 


370 
* 


875 


380 
* 




GTC 
CAG 
Val 


CTT 
GAA 
Leu 


ATC 
He 


TTG 
Asn 


AAG 

Lys 


GAG 
Glu 


GGC 
CCG 
Gly 


CAT 
GTA 
His 


GTG 
CAC 
Val 


AAG 
Lys 


ATG 
TAC 
Met 


TGT 
ACA 

Cys 


GAC 
CTG 
Asp 


TTT 
AAA 
Phe 


GGC 
CCG 

Gly 


ATC 
TAG 
Ile> 


885 


990 


395 


900 


905 


910 
* 


915 


920 


925 


930 
• 


AGT GGC 
TCA CCG 
Ser Gly 


TAC 
ATG 
Tyr 


TTG 
AAC 
Leu 


\J 1 -3 

CAC 
Val 


GAC 
CTG 
Asp 


TCT 
AGA 
Ser 


GTG 
CAC 
Val 


GCC 
CGG 
Ala 


AAG 
Lys 


ACG 
TGC 
Thr 


ATG 
TAC 
Met 


GAT 
CTA 
Asp 


GCC 
CGG 
Ala 


GGC TGC 
CCG ACG 
Gly Cys> 


935 


940 


945 


950 
* 


955 


960 
* 


965 


970 
• 


975 




AAG 
TTC 
Lys 


CCC 
GGG 
Pro 


* 

TAC 
ATG 
Tyr 


ATG 
TAC 
Met 


GCC 
CGG 
Ala 


CCT 
GGA 
Pro 


GAG 
CTC 
Glu 


AGG 
TCC 
Arg 


ATC 
TAG 
lie 


AAC 
TTG 
Asn 


CCA 
GGT 
Pro 


GAG 
CTC 
Glu 


CTG 
GAC 
Leu 


AAC 
TTG 
Asn 


CAG AAG 
Gin Lys> 



980 985 990 995 1000 1005 1010 1015 1020 1025 
• * ♦ * * 

GGC TAC AAT GTC AAG TCC GAC GTC TGG AGC CTG GGC ATC ACC ATG ATT 
CCG ATG TTA CAG TTC AGG CTG CAG ACC TCG GAC CCG TAG TGG TAC TAA 
Gly Tyr Asn Val Lys Ser Asp Val Trp Ser Leu Gly He Thr Met Ile> 

1030 1035 1040 1045 1050 1055 1060 1065 1070 1075 
* * ♦ * * 

GAG ATG GCC ATC CTG CGG TTC CCT TAC GAG TCC TGG GGG ACC CCG TTC 
CTC TAC CGG TAG GAC GCC AAG GGA ATG CTC AGG ACC CCC TGG GGC AAG 
Glu Met Ala lie Leu Arg Phe Pro Tyr Glu Ser Trp Gly Thr Pro Phe> 

1080 1085 1090 1095 1100 1105 1110 1115 1120 

FIG. 4B 
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CAG CAG CTG AAG CAG GTG GTG GAG GAG CCG TCC CCC CAG CTC CCA GCC 
GTC GTC GAC TTC GTC CAC CAC CTC CTC GGC AGG GGG GTC GAG GGT CGG 
Gin Gin Leu Lys Gin Val Val Glu Glu Pro Ser Pro Gin Leu Pro Ala> 

1125 1130 1125 1140 1145 1150 1155 1160 1165 1170 
* • • * • 

GAC CGT TTC TCC CCC GAG TTT GTG GAC TTC ACT GCT CAG TGC CTG AGG 
CTG GCA AAG AGG GGG CTC AAA CAC CTG AAG TGA CGA GTC ACG GAC TCC 
Asp Arg Phe Ser Pro Glu Phe Val Asp Phe Thr Ala Gin Cys Leu Arg> 

1175 1130 11S5 1190 1195 1200 1205 1210 1215 
* * * * 

AAG AAC CCC GCA GAG CGT ATG AGC TAC CTG GAG CTG ATG GAG CAC CCC 
TTC TTG GGG CGT CTC GCA TAC TCG ATG GAC CTC GAC TAC CTC GTG GGG 
Lys Asn Pro Ala Glu Arg Met Ser Tyr Leu Glu Leu Met Glu His Pro> 

1220 1225 1230 1235 1240 1245 1250 1255 1260 1265 
* * * * * 

TTC TTC ACC TTC CAC AAA ACC AAG AAG ACG GAC ATT GCT GCC TTC GTS 
AAG AAG TGG AAC GTG TTT TGG TTC TTC TGC CTG TAA CGA CGG AAG CAC 
Phe Phe Thr Leu His Lys Thr Lys Lys Thr Asp lie Ala Ala Phe Val> 

1270 1275 1280 1285 1290 1295 1300 1305 1310 1215 1320 
* * * * * * 

AAG AAG ATC CTG GGA GAA GAC TCA TAGGGGCTG GGCCTCGGAC CCCACTCCGG 
TTC TTC TAG GAC CCT CTT CTG AGT ATCCCCGAC CCGGAGCCTG GGGTGAGGCC 
Lys Lys lie Leu Gly Glu Asp Ser> (SEQ ID N0:2) 

1375 1280 

ATAAGCTACT 
TATTCGATGA 

1435 1440 

TGGCTCTGTG 
ACCGAGACAC 

1495 1500 

CCAGGCCCTT 
GGTCCGGGAA 

1555 1560 
* 

TGCTGAGATC 
ACGACTCTAG 

1615 1520 

AGCAGGCTGC 
TCGTCCGACG 

1675 1680 
* 

AGTTGTATAT 
TCAACATATA 

1735 1740 
* 

CCACACCTCT 

FIG. 4C 
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1325 1330 1225 1340 1345 1350 1355 1360 1365 1370 
***** 

CCCTCCAGAG CCCCACAGCC CCATCTGCGG GGGCAGTGCT CACCCACACC 
GGGAG G TCTC GGG G T G TCGG GGTAGACGCC CCCGTCACGA GTGGGTGTGG 

1385 1390 1295 1400 1405 1410 1415 1420 1425 1430 
# * * * * 

GCCATCCTGG CCCAGGGCAT CTGGGAGGAA CCZXGGGGGC TGCTCCCACC 
CGGTAGGACC GGGTCCCGTA GACCCTCCTT GGCTCCCCCG ACGAGGGTGG 

1445 1450 1455 1460 1465 1470 1475 1480 1485 1490 
***** 

GCGAGCCATT TGTCCCAAGT GCCAAAGAAG CAGACCATTG GGGCTCCCAG 
CGCTCGGTAA ACAGGGTTCA CGGTTTCTTC GTCTGGTAAC CCCGAGGGTC 

1505 1510 1515 1520 1525 1530 1535 1540 1545 1550 
***** 

CAGCCGGGGT GGTCACGGAG AGGGACGACG AGGATCCTGG GCAGAGGTCG 

1565 1570 1575 1580 1585 1590 1595 1600 1605 1610 
***** 

CTGGACTGAG GGGGCCTGGA TGCCCCCTGT GGATGCTGCT GCCCCTGCAC 
GACCTGACTC CCCCGGACCT ACGGGGGACA CCTACGACGA CGGGGACGTG 

1625 1630 1635 1640 1645 1650 1655 1660 1665 1670 
***** 

CAGTGCCTGG GTGGATGGGC CACCGCCTTG CCCAGCCTGG ATGCCATCCA 
GTCACGGACC CACCTACCCG GTGGCGGAAC GGGTCGGACC TACGGTAGGT 

1685 1690 1695 1700 1705 1710 1715 1720 1725 1730 
***** 

TTTTTTAATC TCTCGACTGA ATGGACTTTG CACACTTTGG CCCAGGGTGG 
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AAAAAATTAG AGAGGTGACT TACCTGAAAC GTGTGAAACC GGGTCCCACC GG7GTGGAGA 
1745 1750 1755 1760 1765 1770 1^75 1780 17S5 1790 i:*95 1300 



ATCCCGGCTT TGGTGCGGGG TACACAAGAG GGGATGAGTT GTGTGAATAC CCCAAGACT^ 
TAGGGCCGAA ACCACGCCGC ATGTGTTCTC CCGTACTCAA CACACT7ATG GGGTTCTGAG 

1805 1810 1315 1320 1325 1330 1525 1340 1345 1350 1355 1350 

* * • • # » 
CCATGAGGGA GATGCCATGA GCGGCGC^G GCCTTCCCCT GGCACIGGCA AACAGGGCCT 
GGTACTCCGT CTACGGTACT CGGCGGGTTC GGGAAGGGGA CCGTGACCGT TTGIl'CCGGA 

1865 1370 1875 1530 1235 1390 1395 1900 1905 1910 1915 1^20 

* * * * # » 

CTGCGGAGCA CACTGGCTCA CCCAGTCC7G CCCGGCACGG TTATCGGTGT CATTCACCTT 
GACGCCTCGT GTGACGGAGT GGGTCAGGAC GGGCGGTGGC AATAGCCACA GTAAGTGGAA 

1925 1920 1925 194C 1945 1950 1955 1950 1955 1970 1975 1930 

* • * ♦ * 
TTCIGT— • TTTAATTTAT CCTCTGTTGA ITTTZTCTTT TGCTTTATGG GZT^GC773 
AGCACAAAAA AAATTAAATA GGAGACAACT AAAAAAGAAA ACGAAATACC CAAACCGAAG 

1985 1990 1995 2000 2005 2010 2015 2020 2025 2030 

* * # * * 

I TTTTCTTGC ATGGTTTGGA GCTGATCGCT TCTCCCCCAC CCCCTAGGGG (SEO ID VO- n 
AAAAAGAACG TACCAAACCT CGACTAGCGA AGAGGGGGTG GGGGATCCCC ' 

FIG. 4D 
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5 in •= 20 25 30 35 40 45 50 55 50 

TAGCTGCAGC ACAGCCTTCC CTAACGTTGC AACTGGGGGA AAAATCACTT TCCAGTCTCT 
ATCGACGTCG TGTCGGAAGG GATTGCAACG TTGACCCCCT TTTTAGTGAA AGGTCAGACA 

65 70 75 30 55 90 35 100 105 110 115 120 

TTTGCAAGGT GTGCATTTCC ATCTTGATTC CCT CAAAG TC CATCTC-CTGC ATCGGTCAAG 
AAACGTTCCA CACGTAAAGG TAGAACTAAG GGACTTTCAG GTAGACGACG ^uCCAGTTC 

125 130 135 140 145 150 155 160 165 170 175 180 

* * ' 

AGAAACTCCA CTTGCATGAA GATTGCACGC CTGCAGCTTG CATCTTTGTT GCAAAACTAG 
TCTTTGAGGT GAACGTACTT CTAACGTGCG GACGTCGAAC GTAGAAACAA CCTTTTGATC 

18S 120 135 200 205 110 215 220 225 230 235 240 

CTACAGAAGA GAAGCAAGGC AAAGTCTTTT GTGCTCCCCT CCCCCATCA^ AGGAAAGGGG 
GATGTCTTCT CTTCGTTCC3 TTTCAGAAAA CACGAGGGGA GGGGGTAGTT TCCTTTCCCC 

245 250 255 260 265 270 275 280 285 
. • • * 

AAA ATG TCT CAG TCG AAA GGC AAG AAG CGA AAC CCT GGC CTT AAA ATT 
TTT ?AC AGA GTC AGC TTT CCG TTC TTC GCT TTG GGA CCG GAA ITT TAA 
Mec Ser Gin Ser Lys Gly Lys Lys Arg Asn Pro Gly Leu Lys Ile> 

290 295 300 305 310 315 320 325 330 335 

CCA AAA GAA GCA TTT GAA CAA CCT CAG ACC AGT TCC ACA CCA CCT AGA 
GGT TTT CTT CGT AAA CTT GTT GGA GTC TGG TCA AGG TGT GGT GxjA TCT 
Pro Lys Glu Ala Phe Glu Gin Pro Gin Thr Ser Ser Thr Pro Pro Arg> 

340 345 350 355 360 365 370 375 380 

GAT TTA GAC TCC AAG GCT TGC ATT TCT ATT GGA AAT CAG AAC TTT GAG 
CTA AAT CTG AGG TTC CGA ACG TAA AGA TAA CCT Z^A. GTC TTG «nA CTC 
Asp Leu Asp Ser Lys Ala Cys He Ser lie Gly Asn Gin Asn Phe Glu> 



385 390 395 



400 405 410 415 420 425 430 

^ • * 



GTG AAG GCA GAT GAC CTG GAG CCT ATA ATG GAA CTG GGA CGA GGT GCG 
CAC TTC CGT CTA CTG GAC CTC GGA TAT TAC CTT GAC CCT GCT CCA CGC 
Val Lys Ala Asp Asp Leu Glu Pro He Met Glu Leu Gly Arg Gly Ala> 

435 440 445 450 455 460 465 470 475 480 

TAC GGG GTG GTG GAG AAG ATG CGG CAC GTG CCC AGC GGG CAG ATC ATG 
ATG CCC CAC CAC CTC TTC TAC GCC GTG CAC GGG TCG CCC GTC .AG TAC 
Tyr Gly Val Val Glu Lys Met Arg His Val Pro Ser Gly Gin He Meo 



485 490 495 



500 505 510 515 520 525 



GCA GTG AAG CGG ATC CGA GCC ACA GTA AAT AGC CAG GAA CAG AAA CGG 
CGT CAC TTC GCC TAG GCT CGG TGT CAT TTA TCG GTC CTT GTC aw> 
Ala Val Lys Arg He Arg Ala Thr Val Asn Ser Gin Glu Gin Lys Arg> 

530 535 540 545 550 555 560 565 570 575 
• * » * 

CTA CTG ATG GAT TTG GAT ATT TCC ATG AGG ACG GTG GAC TGT T?S 
GAT GAC TAC CTA AAC CTA TAA AGG TAC TCC TGC CAC CTG ACA GGT AAG 

FIG. 5A 
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Leu Leu Met Asp Leu Asp lie Ser Met Arg Thr Val Asp Cys Pro Phe> 

580 535 590 595 500 505 610 615 £20 
* » # » » 

ACT GTC ACC TTT TAT GGC GCA CTG ITT CGG GAG GGT GAT GTG TGG ATC 
TGA CAG TGG AAA ATA CCG CGT GAC AAA GCC CTC CCA CTA CAC ACC TAG 
Thr Val Thr Phe T/r Gly Ala Leu Phe Arg Glu Gly Asp Val Trp lle> 

625 530 535 640 645 650 555 660 665 570 
* » * » + 

TGC ATG GAG CTC ATG GAT ACA TCA CTA GAT AAA TTC TAC AAA CAA GTT 
ACG TAC CTC GAG TAC CTA TGT AGT GAT CTA TTT AAG ATG TTT GTT CAA 
Cys Met Glu Leu Met Asp Thr Ser Leu Asp Lys Phe Tyr Lys Gin Val> 

675 530 535 690 695 700 705 710 715 720 
» * * * * 

ATT GAT AAA GGC CAG ACA ATT CCA GAG GAC ATC TTA GGG AAA ATA GCA 
TAA CTA TTT CCG GTC TGT TAA GGT CTC CTG TAG AAT CCC TTT TAT CGT 
He Asp Lys Gly Gin Thr lie Pro Glu Asp lie Leu Gly Lys lie Ala> 

725 730 ^35 740 745 750 755 760 '55 
* * * * 

GTT TCT ATT GTA AAA GCA TTA GAA CAT TTA CAT AGT AAG CTG TCT GTC 
CAA AGA TAA CAT TTT CGT AAT CTT GTA AAT GTA TCA TTC GAC AGA CAG 
Val Ser He Val Lys Ala Leu Glu His Leu His Ser Lys Leu Ser Val> 

770 775 780 785 790 795 300 305 810 815 



* 






* 








* 


* 






* 




ATT 


CAC 


AGA 


GAC 


GTC 


AAG 


CCT 


TCT AAT GTA 


CTC 


ATC 


AAT 


GCT CTC 


GGT 


TAA 


GTG 


TCT 


CTG 


CAG 


TTC 


GGA 


AGA TTA CAT 


GAG 


TAG 


TTA 


CGA GAG 


CCA 


He 


His 


Arg Asp 


Val 


Lys 


Pro 


Ser Asn Val 


Leu 


He 


Asn Ala Leu Gly> 


820 
• 


325 


330 
* 


335 


840 845 
• 


850 
• 


355 


360 
* 




CAA 


GTG 


AAG 


A1U 




GAT 


TTT 


GGA ATC AGT 


GGC 


TAC 


TTG 


GTG GAC 


TCT 


GTT 


v-AC 




TAC 




CTA 


AAA 


CCT TAG TCA 


CCG 


ATG 


AAC 


CAC CTG 


AGA 


Gin 


Val 


Lys 


Met 


w s 


Asp 


Phe 


Gly He Ser 


Gly 


Tyr 


Leu 


Val Asp 


Ser> 


55 


370 
* 


875 


380 
* 


335 


890 395 
• 


900 
* 


905 910 
* 


GTT 


GCT 


AAA 


ACA 


ATT 


GAT 


GCA 


GGT TGC AAA 


CCA 


TAC 


ATG 


GCC CCT 


GAA 


CAA 


CGA 


TTT 


TGT 


TAA 


CTA 


CGT 


CCA ACG TTT 


GGT 


ATG 


TAC 


CGG GGA 


CTT 


Val 


Ala 


Lys 


Thr 


He 


Asp 


Ala 


Gly Cys Lys 


Pro 


Tyr 


Met 


Ala Pro 


Glu> 


915 


920 
* 


925 


930 
* 


935 940 
* 


945 


950 


955 


960 
* 


AGA 


ATA 


AAC 


CCA 


GAG 


CTC 


AAC 


CAG AAG GGA 


TAC 


AGT 


GTG 


AAG TCT 


GAC 


TCT 


TAT 


TTG 


GGT 


CTC 


GAG 


TTG 


GTC TTC CCT 


ATG 


TCA 


CAC 


TTC AGA 


CTG 


Arg 


He Asn 


Pro 


Glu 


Leu 


Asn Gin Lys Gly 


Tyr 


Ser 


Val 


Lys Ser 


Asp> 



965 970 975 980 985 990 995 1000 1005 
* * * * 

ATT TGG AGT CTG GGC ATC ACG ATG ATT GAG TTG GCC ATC CTT CGA TTT 
TAA ACC TCA GAC CCG TAG TGC TAC TAA CTC AAC CGG TAG GAA GCT AAA 
He Trp Ser Leu Gly He Thr Met He Glu Leu Ala He Leu Arg Phe> 

1010 1015 1020 1025 1030 1035 1040 1045 1050 1055 
* « * * * 

CCC TAT GAT TCA TGG GGA ACT CCA TTT CAG CAG CTC AAA CAG GTG GTA 
GGG ATA CTA AGT ACC CCT TGA GGT AAA GTC GTC GAG TTT GTC CAC CAT 
Pro Tyr Asp Ser Trp Gly Thr Pro Phe Gin Gin Leu Lys Gin Val Val> 

FIG. 5B 
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1060 1065 1070 1075 1080 1085 1090 1095 1100 



i. * A 



GAG GAG CCA TCG CCA CAA CTC CCA GCA GAC AAG TTC TCT GCA GAG 
CTC CTC GGT AGC GGT GTT GAG GGT CGT CTG TTC AAG AGA CGT CTC AAA 
Glu Glu Pro Ser Pro Gin Leu Pro Ala Asp Lys Phe Ser Ala Glu Phe> 

1105 1110 1115 1120 1125 1130 1125 1140 1145 1150 
• * * * * 

GTT GAC TTT ACC TCA CAG TGC TTA AAG AAG AAT TCC AAA GAA CGG CCT 
CAA CTG AAA TGG AGT GTC ACG AAT TTC TTC TTA AGG TTT CTT GCC GGA 
Val Asp Phe Thr Ser Gin Cys Leu Lys Lys Asn Ser Lys Glu Arg Pro 

1155 1160 1165 1170 1175 1180 1185 1190 1195 1200 

* * * * * 

ACA TAC CCA GAG CTA ATG CAA CAT CCA TTT TTC ACC CTA CAT GAA TCC 
TGT ATG GGT CTC GAT TAC GTT GTA GGT AAA AAG TGG GAT GTA CTT AGG 
Thr Tyr Pro Glu Leu Met Gin His Pro Phe Phe Thr Leu His Glu Ser> 

1205 1210 1215 1220 1225 1220 1235 1240 1245 1250 
« * * * * 

AAA GGA ACA GAT GTG GCA TCT TTT GTA AAA CTG ATT CTT GGA GAC T AAAA 
TTT CCT TGT CTA CAC CGT AGA AAA CAT TTT GAC TAA GAA CCT CTG ATTTT 
Lys Gly Thr Asp Val Ala Ser Phe Val Lys Leu He Leu Gly Asp> (seq ID NO: 4) 

1255 1260 1265 1270 1275 1280 1285 1290 1295 1300 1205 1310 

* ♦ * * * • 

AGCAGTGGAC TTAATCGGTT GACCCTACTG TGGATTGGTG GGTTTCGGGG TGAAGCAAGT 
TCGT CA CCTG AATTAGCCAA CTGGGATGAC ACCTAACCAC CCAAAGCCCC ACTTCGTTCA 

1315 1220 1225 1230 1335 1340 1345 1350 1355 1360 1265 1370 

* * * ♦ * * 

TCACTACAGC ATCAATAGAA AGTCATCTTT GAGATAATTT AACCCTGCCT CTCAGAGGGT 
AGTGATGTCG TAGTTATCTT TCAGTAGAAA CTCTATTAAA TTGGGACGGA GAGTCTCCCA 

1375 1380 1385 1290 1295 1400 1405 1410 1415 1420 1425 1420 

* # * * * • 

TTTCTCTCCC AATTTTCTTT TTACTCCCCC TCTTAAGGGG GCCTTGGAAT CTATAGTATA 
AAAGAGAGGG TTAAAAGAAA AATGAGGGGG AGAATTCCCC CGGAACCTTA GATATCATAT 

1435 1440 1445 1450 1455 1460 1465 1470 1475 1480 1485 1490 



* * 



* 



GAATGAACTG TCTAGATGGA TGAATTATGA TAAAGGCTTA GGACTTCAAA AGGTGATTAA 
CTTACTTGAC AGATCTACCT ACTTAATACT ATTTCCGAAT CCTGAAGTTT TCCACTAATT 

1495 1500 1505 1510 1515 1520 1525 1530 1535 1540 1545 1550 

• * • * * * 

ATATTTAATG ATGTGTCATA TGAGTCCTCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
TATAAATTAC TACACAGTAT ACTCAGGAGT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT 

1555 1560 1565 1570 1575 1580 1585 1590 1595 1600 

♦ * * * * 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA (SEQ ID N0:3) 
rTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TT 

FIG. 5C 
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5 10 15 20 25 30 35 40 45 50 55 

* • * » * 

CTAGGGTCCC ZZGCGCCAGG CCACCCGGCC GTCAGCAGC ATG CAG GGT AAA CGC AAA 
GATCCCAGGG GCCGCGGTCC GGTGGGCCGG CAGTCGTCG TAC GTC CCA TTT GCG TTT 

Met Gin Gly Lys Arg Lys> 

60 65 70 "5 80 35 90 95 100 105 

* * * * * 

GCA CTG AAG TTG AAT TTT GCA AAT CCA CCT TTC AAA TCT ACA GCA AGG 
CGT GAC TTC AAC TTA AAA CCT TTA GGT GGA AAG TTT AGA TCT CGT TCC 
Ala Leu Lys Leu Asn Phe Aia Asn Pro Pro Phe Lys Ser Thr Ala Arg> 

110 115 120 125 130 135 140 145 150 
* * * * * 

TTT ACT CTG AAT CCC AAT CCT ACA GGA GTT CAA AAC CCA CAC ATA GAG 
AAA TGA GAC TTA GGG TTA GGA TGT CCT CAA GTT TTG GGT GTG TAT CTC 

Phe Thr Leu Asn Pro Asn Pro Thr Gly Val Gin Asn Pro His lie Glu> 

155 160 155 170 175 180 185 190 195 200 

* * * * * 

AGA CTG AGA ACA CAC AGC ATT GAG TCA TCA GGA AAA CTG AAG ATC TCC 

TCT GAC TCT TGT GTG TCG TAA CTC AGT AGT CCT TTT GAC TTC TAG AGG 

Arg Leu Arg Thr His Ser lie Glu Ser Ser Gly Lys Leu Lys lie Ser> 

205 210 215 220 225 230 235 240 245 

* * * * 

CCT GAA CAA CAC TGG GAT TTC ACT GCA GAG GAC TTG AAA GAC CTT GGA 
GGA CTT GTT GTG ACC CTA AAG TGA CGT CTC CTG AAC TTT CTG GAA CCT 
Pro Glu Gin His Trp Asp Phe Thr Ala Glu Asp Leu Lys Asp Leu Gly> 

250 255 260 265 270 275 280 285 290 295 
* * * * * 

GAA ATT GGA CCA GGA GOT TAT GGT TCT GTC AAC AAA ATG GTC CAC AAA 
CTT TAA CCT GCT CCT CGA ATA CCA AGA CAG TTG TTT TAC CAG GTG TTT 

Glu lie Gly Arg Giy Ala Tyr Gly Ser Val Asn Lys Met Vai His Lys> 

300 305 310 315 320 325 330 335 340 345 
* * * • » 

CCA AGT GGG CAA ATA ATG GCA GTT AAA AGA ATT CGG TCA ACA GTG GAT 
GGT TCA CCC GTT TAT TAC CGT CAA TTT TCT TAA GCC AGT TGT CAC CTA 
Pro Ser Gly Gin lie Met Ala Val Lys Arg lie Arg Ser Thr Val Asp> 

350 355 360 365 370 375 380 385 390 
* * * * * 

GAA AAA GAA CAA AAA CAA CTT CTT ATG GAT TTG GAT GTA GTA ATG CGG 
CTT TTT CTT GTT TTT GTT GAA GAA TAC CTA AAC CTA CAT CAT TAC GCC 
Glu Lys Glu Gin Lys Gin Leu Leu Met Asp Leu Asp Val Val Met Arg> 

395 400 405 410 415 420 425 430 435 440 
* * * # * 

AGT AGT GAT TGC CCA TAC ATT GTT CAG TTT TAT GGT GCA CTC TTC AGA 
TCA TCA CTA ACG GGT ATG TAA CAA GTC AAA ATA CCA CGT GAG AAG TCT 
Ser Ser Asp Cys Pro Tyr lie Val Gin Phe Tyr Gly Ala Leu Phe Arg> 

445 450 455 460 465 470 475 480 485 
* * # # 

GAG GGT GAC TGT TGG ATC TGT ATG GAA CTC ATG TCT ACC TCG TTT GAT 
CTC CCA CTG ACA ACC TAG ACA TAC CTT GAG TAC AGA TGG AGC AAA CTA 
Glu Gly Asp Cys Trp lie Cys Met Glu Leu Met Ser Thr Ser Phe Asp> 
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490 495 300 505 510 515 520 525 530 535 
* * * * • 

AAG TTT TAC AAA TAT GTA TAT AGT GTA TTA GAT GAT GTT ATT CCA GAA 
TTC AAA ATG TTT ATA CAT ATA TCA CAT AAT CTA CTA CAA TAA GGT CTT 
Lys Fhe Tyr Lys Tyr Val T/r Ser Vai Leu Asp Asp Val lie Pro Glu> 



540 545 550 555 550 555 570 575 590 



:35 




LS> 

590 595 500 605 510 515 620 625 530 

TTA GAA ^ C ^ ATvr ATT C«C AGA GAT ATC AAA CCT ~CC AAT 

AAT TTT CTT TTG AAC TTT TAA TAA GTG TCT CTA TAG TTT GGA AGG TTA 
Leu Lys Glu Asn Leu Lys He He His Arg Asp lie Lys Pro Ser Asn> 



635 640 545 550 555 £50 565 670 



580 



ATT CTT CTG GAC AGA AGT GGA AAT ATT AAG CTC TGT GAC T^ GGC A~C 
TAA GAA GAC CTG TCT TCA CCT TTA TAA TTC GAG ACA CTG AAG CCG TAG 
He Leu Leu Asp Arg Ser Gly Asn He Lys Leu C/s Asp Fhe Gly He> 

685 690 595 700 705 710 715 720 725 
* * # * 

AGT GGA CAG CTT GTG GAC TCT ATT GCC AAG ACA AGA GAT GCT GGC 
TCA CCT GTC GAA CAC CTG AGA TAA CGG TTC TGT TCT CTA CGA CCG ACA 
Ser Gly Gin Leu Val Asp Ser He Ala Lys Thr Arg Asp Ala Gly Cys> 

730 735 740 745 750 755 760 765 770 775 
* * * # • 

AGG CCA TAC ATG GCA CCT GAA AGA ATA GAC CCA AGC GCA TCA CGA CAA 
TCC GGT ATG TAC CGT GGA CTT TCT TAT CTG GGT TCG CGT AGT GCT GTT 
Arg Pro Tyr Met: Ala Pro Glu Arg He Asp Pro Ser Ala Ser Arg Gln> 

780 785 790 795 800 305 310 815 320 325 
* * * » ♦ 

GGA TAT GAT GTC CGC TCT GAT GTC TCG AGT TTG GGG k7C ACA TTG TAT 
CCT ATA CTA CAG GCG AGA CTA CAG ACC TCA AAC CCC TAG TGT AAC ATA 
Gly T/r Asp Val Arg Ser Asp Val Trp Ser Leu Gly He Thr Leu Tyr> 

830 835 840 845 850 355 860 865 370 



* 



GAG TTG GCC ACA GGC CGA TTT CCT TAT CCA AAG TGG AAT AGT GTA TTT 
CTC AAC CGG TGT CCG GCT AAA GGA ATA GGT TTC ACC TTA TCA CAT AAA 
Glu Leu Ala Thr Gly Arg Phe Pro T/r Pro Lys Trp Asn Ser Val Phe> 

875 880 385 890 895 900 905 910 915 920 



GAT CAA CTA ACA CAA GTC GTG AAA GGA GAT CCT CCG CAG CTG AGT AAT 
CTA GTT GAT TGT GTT CAG CAC TTT CCT CTA GGA GGC GTC GAC TCA TTA 
Asp Gin Leu Thr Gin Val Val Lys Gly Asp Pro Pro Gin Leu Ser Asn> 

925 930 935 940 945 950 955 960 965 
* * * * 

TCT GAG GAA AGG GAA TTC TCC CCG AGT TTC ATC AAC TTT GTC AAC TTG 
AGA CTC CTT TCC CTT AAG AGG GGC TCA AAG TAG TTG AAA CAG TTG AAC 
Ser Glu Glu Arg Glu Phe Ser Pro Ser Phe He Asn Phe Val Asn Leu> 

970 975 980 985 990 995 1000 1005 1010 1015 
* * * * • 
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TGC CTT ACG AAG GAT GAA TCC AAA AGG CCA AAG TAT AAA GAG CTT CC 
ACG GAA TGC TTC CTA CTT AGG TTT TCC GGT TTC ATA TTT CTC GAA GAC 
Cys Leu Thr Lys Asp GIu Ser Lys Arg Pro Lys Tyr Lys Gia Leu Leu> 

1020 1025 1330 1035 1040 1045 1050 1055 1C60 1065 

• • > 

AAA CAT CCC TTT ATT TTG ATG TAT GAA GAA CGT GCC GTT GAG GTC GCA 
TTT GTA GGG AAA TAA AAC TAC ATA CTT CTT GCA CGG CAA CTC CAG CGT 
Lys His Pro Phe lie Leu Met Tyr Glu Glu Arg Ala Val GIu Val Ala> 

1070 1075 1Q80 1085 1090 1095 1100 1105 mo 

TGC TAT GTT TGT AAA ATC CTG GAT C^A ATG CCA GCT ACT CCC AGC 
ACG ATA CAA ACA TTT TAG GAC CTA GTT TAC GGT CGA TGA GGG TCG AGA 
Cys Tyr Val Cys Lys He Leu Asp Gin Met Pro Ala Thr Pro Ser Ser> 

1115 1120 1125 1130 1135 1140 1145 1150 1155 1150 • 1 55 " 7 n 

£SS 2?2 "iT S?S SiT ~ A ~A7ccrrac tacatcagac tctagaaaaa agggctgaga 

i^v. w.o -_ .rt AC TATAGCRACG ATGTAGTCTG ACATCTTTTT ~- — CGACTr— ' 
Fro Met Tyr Val Asp> 

1175 1130 1135 1190 1195 1200 1205 1210 1215 1220 1225 ^230 

GGAAGCAAGA CG TAAAGAA T TTTCATCCCG TATCACAGTG TTTTTATTGC TCGCCCAGAC 
CCTTC^TTCT GCATTTCTTA AAAGTAGGGC ATAGTGTCAC AAAAATAACG AGCGGGTCTG 

1235 1240 1245 1250 1255 1260 1265 1270 1275 1280 "285 1290 

• • * • • 

ACCATGTGCA ATAAGATTGG TGTTCGTTTC CATCATGTCT GTATACTCCT GTCACCTAGA 
TGGTACACGT TATTCTAACC ACAAGCAAAG GTAGTACAGA CATATGAGGA CAGTGGATCT 

1295 1300 1305 1310 1315 1320 1325 1330 1335 1340 "345 "SO 

* • . 

ACGTCCATCC TTGTAATACC TGATTGATCA CACAGTGTTA GTGCTGGTCA GAGAGACCTC 
TGCACGTAGG AACATTATGG ACTAACTAGT GTGTCACAAT CACGACCAGT CTCTCTGGAG 

1355 1350 1355 1370 1375 1330 1385 1390 1395 1400 "405 ' 410 

* * 

^EltP^ i-i-iwiuATGA ACATATTCAT GAAATGTGGA AGTCAGTACG ATCAAGTTGT 
.AGGACGAGA AAACACTACT TGTATAAGTA CTTTACACCT TCAGTCATGC TAGTTCAACA 

1415 1420 1425 1430 143S 1440 1445 1450 1455 1460 1465 1470 

**••*. 

TGACTGTGAT TAGATCACAT CTTAAATTCA TTTCTAGACT CAAAACCTGG AGATGCAGCT 
ACTGACaCTA ATCTAGTGTA GAATTTAAGT AAAGATCTGA GTTTTGGACC TCTACGTCGA 

1475 1480 1485 1490 1495 1500 1505 1510 1515 1520 1525 1530 

* * • * . " • 

ACTGGAATGG TGTTTTGTCA GACTTCCAAA TCCTGGAAGG ACACAGTGAT GAATGTACTA 
TGACCTTACC ACAAAACAGT CTGAAGGTTT AGGACCTTCC TGTGTCACTA CTTACATGAT 

1535 1540 1545 1550 1555 1560 1S65 1570 1575 1580 1585 1590 

****** 

TATCT CAACA TAGAAACTCG GGCTTGAGTG AGAAGAGCTT GCACAGCCAA CGAGACACAT 
ATAGACTTCT ATCTTTGAGC CCGAACTCAC TCTTCTCGAA CGTGTCGGTT GCTCTGTGTA 

1595 1600 1605 1610 1615 1620 1625 1630 1635 1640 1645 1550 
**•».# 
JSSS^ ^y? 30 ^ CAAAGGAGGA ATTTACTTTC TTCACCAAGT GCAATAGATT 
ACGoAAGACC TCGACCCTCT GTTTCCTCCT TAAATGAAAG AAGTGGTTCA CGTTATCTAA 

FIG. 6C 
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1655 1650 1565 1670 1575 1680 1685 1590 1695 1700 1735 1710 
» • • * * • 

ACTGATGTGA T AT TCT GT T G CTTTACAGTT ACAGTTGATG TTTGGGGATC GATGTGCTCA 
TGACTACACT ATAAGACAAC GAAATGTCAA TGTCAACTAC AAACCCCTAG CTACACGAGT 

1715 1720 1725 1730 1735 1740 1745 1750 1755 1760 1755 1770 

* * ♦ 9 * * 

GCCAAATTTC CTGTTTGAAA 7ATCATGTTA AATTAGAATG AATTTATCTT TACCAAAAAC 
CGGTTTAAAG GACAAACTTT ATAGTACAAT T7AATCT7AC TTAAATAGAA ATGGTTTTTG 

1775 1780 1785 1790 1795 1300 1305 1310 1315 1320 1H25 1830 

* • * * * * 

CATGTTGCGT TCAAAGAGGT GAACATTAAA ATATAGAGAC AGGACAGAAT GTGTTCTTTT 
GTACAACGCA AG TTTCT CCA CTT3TAAT7T TATATCTCTG TCCTGTCTTA CACAAGAAAA 

1835 1840 1345 1350 1355 1860 1365 1370 1875 1880 1835 1890 

* * * • * » 

CTCCTCTACC AGTCCTATTT TTCAATGGGA AGACTCAGGA GTCTGCCACT TC TCAAAGAA 
GAGGAGATGG TCAGGATAAA AAGTTACCCT TCTGAGTCGT CAGACGGTGA ACAGTTTCTT 

1895 1900 1905 1910 1215 1920 1925 1930 1935 1940 1945 1950 

* • * * * * 

GGTGCTGATC CTAAGAA7TT TTCATTCTCA GAATTCGGTG TGCTGCCAAC TTGATGTTCC 
CCACGACTAG GATTCTTAAA AAGTAAGAGT CTTAAGCCAC ACGACGGTTG AACTACAAGG 

1955 1960 1965 1970 1975 1980 1985 1290 1995 2000 2005 2010 

* • * * * * 

ACCTGCCACA AACCACCAGG ACTGAAAGAA GAAAACAGTA CAGAAGGCAA AGTT TACAGA 
TGGACGGTGT TTGGTCGTCC TGA CTTTCTT CTTTTOTCAT GTCTTCCGTT TCAAATGTCT 

2015 2020 2025 2030 2035 2040 2045 2050 2055 2060 2065 2070 

* * * * * * 

TGTTTTTA AT TCTAGTATTT TATCTGGAAC AA CT T GT AGC AGCTATATAT TTCCCCTTGG 
ACAAAAATTA AGATCATAAA ATAGACCTTG TTGAACATCG TCGATATATA AAGGGGAACC 

2075 2080 2085 2090 2095 2100 2105 2110 2115 2120 2125 2130 
« * • * * * 

TCCCAAGCCT GATACTTTAG CCATCATAAC TCACTAACAG GGAGAAGTAG CTAG TAGCAA 
AGGGTTCGGA CTATGAAATC GGTAGTATTG AGTGATTGTC CCTCTTCATC GATCATCGTT 

2135 2140 2145 2150 2155 2160 2165 2170 2175 2180 2135 2190 

* * * * * * 

TGTGCCTTGA TTGATTAGAT AAAGATTTCT AGTAGGCAGC AAAAGACCAA ATCTCAGTTG 
ACACGGAACT AACTAATCTA TTTCTAAAGA TCATCCGTCG TTTTCTGGTT TAGAGTCAAC 

2195 2200 2205 2210 2215 2220 2225 2230 2235 2240 2245 2250 

* • * * * * 

TTTOCTTCTT GCCATCACTG GTCCAGGTCT TCAGTTTCCG AATCTCTTTC CC7TTCCCCTG 
AAACGAAGAA CGGTAGTGAC CAGGTCCAGA AGTCAAAGGC TTAGAGAAAG GGAAGGGGAC 

2255 2250 2265 2270 2275 2280 2285 2290 2295 2300 2305 2310 

* • * * * * 

TGGTCTATTG TCGCTATGTG ACTTX3CGCTT AATCCAATAT TTTGCCTTTT TTCTATATCA 
ACCAGATAAC AGCGATACAC TGAACGCGAA TTAGGTTATA AAACGGAAAA AAGATATAGT 

2315 2320 2325 2330 2335 2340 2345 2350 2355 2360 2365 2370 

* * ♦ * * * 

AAAAACCTTT ACAGTTAGCA GGGAT G TTC C TTACCGAGGA TTTTTAACCC CCAATCTCTC 
rmTGGAAA TGTCAATCGT CCCTACAAGG AATGGCTCCT AAAAATTGGG GGTTAGAGAG 

2375 2380 2385 2390 2395 2400 2405 2410 2415 2420 2425 2430 

* * * * # * 

FIG. 6D 
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ATAATCGCTA GTGTT7AAAA GGCTAAGAAT AG7GGGGCCC AACCGATGTG GTAGGTGA7A 
TATTAGCGAT CACAAATTTT CCGATTCTTA TCACCCCGGG TTGGCTACAC CATCCACTA? 

2435 2440 2445 2450 2455 2460 2465 2470 2475 2480 2485 2490 

* » • * • • 

AAGAGGCATC TTTTCTAGAG ACACATTCGA CCAGATGAGG ATCC GAAAC G GCAGCCTTTA 
TTCTCCGTAG AAAAGATCTC TCTGTAACCT GGTCTACTCG TAGGCTTTGC CGTCGGAAAT 

2495 2500 2505 2510 2515 2520 2525 2530 2535 2540 2545 2550 

* * • # * * 

CGTTCATCAC CTGCTAGAAC CTCTCGTAGT CCATCACCAT TTCTTGGCAT TGGAATTCTA 
GCAAGTAGTG GACGATCTTG GAGAGCATCA GGTAGTGGTA AAGAACCGTA ACCTTAAGA7 

2555 2560 2565 2570 2575 25S0 2535 2590 2595 2600 2505 2610 
» » * * * * 

CTGGAAAAAA ATACAAAAAG CAAAACAAAA CCCTCAGCAC TGTTACAAGA GGCCATTTAA 
GACCTTTTTT T ATCTTTTT C GTTTTGTTrT GGGA G TCGT G ACAATGTTCT CCGGTAAA1T 

2615 2620 2625 2620 2535 2640 2545 2650 2555 2660 2555 2670 

* • • * * • 

GTATCTTGTG CTTCTTCACT TACCCATTAG CCAGGTTCIXT ATTAGGTTTT GCTTGGGCCT 
CATAGAACAC GAAGAAGTGA ATGGG7AA7C GGTCCAAGAG TAATCCAAAA CGAACCCGGA 

2675 2680 2685 2690 2595 2700 2705 2710 2715 2720 2725 2730 

* » * • * * 

CCCTGGCACT GAACCTTAGG CTTTGTA7GA CAGTGAAGCA GCACTGTGAG TGGTTCAAGC 
GGGACCGTGA CTTGGAATCC GAAACATACT GTCACTTCGT CGTGACACTC ACCAAGTTCG 

2735 2740 2745 2750 2755 2760 2765 2770 2775 2780 2785 2790 

* • ♦ * # * 

ACACTGGAAT ATAAAACAGT CATGGCCTGA GATGCAGGTG ATGCCATTAC AGAACCAAAT 
TGTGACCTTA TATTTTGTCA GTACCGGACT CTACGTCCAC TACGGTAATG TCTTGGTTTA 

2795 2800 2805 2810 2315 2820 2325 2830 2835 2840 2345 2850 

* * * * * * 

CGTCGCACGT ATT G CTGT G T CTCCTCTCAG AGTGACAGTC ATAAATACTG TCAAACAATA 
GCACCGTCCA TAACGACACA GAGGAGAGTC TCACTGTCAG TATTTATGAC AGTTTGTTAT 

2855 2860 2865 2370 2375 2880 2385 2390 2395 2900 2905 2910 

* • * * * * 

AAGGGAGAAT GGTGCTGTTT AAAGTCACAT CCCTGTAAA7 TGCAGAATTC AAAAGTGATT 
TTCCCTCTTA CCACGACAAA TTTCAGTC7A GGGACATTTA ACGTCTTAAG TTTTCACTAA 

2915 2920 2925 2930 2935 2940 2945 2950 2955 2960 2965 2970 

* * * * * • 

ATCTCTTTGA TCTACTTGCC TCATTTCCCT ATCTICTCC C CCACGGTATC C TAAAC TTIA 
TAGAGAAACT AGATGAACGG AGTAAAGGGA TAGAAGAGGG GGTGCCATAG GATTTGAAAT 

2975 2980 2985 2990 2995 3000 3005 3010 3015 3020 2025 3030 

* * » # * * 

GACTTCCCAC TGTTCTGAAA GGAGACATTC CTCTATGTCT GCCTTCGACC ACAGCAAGC C 
CTGAAGGGTG ACAAGACTTT CCTCTGTAAC GAGATACAGA CGGAAGCTGG TGTCGTTCGG 

3035 3040 3045 3050 3055 3060 3065 3070 3075 3080 3085 3090 

* * # * * * 

ATCATCCTCC ATTGCTCCCG GGGACTCAAG AGGAATCTGT TTCTCTGCTG TCAACTTCCC 
TAGTAGGAGG TAACGAGGGC CCCTGAGTTC TCCTTAGACA AAGAGACGAC AGTTGAAGGG 

3095 3100 3105 3110 3115 3120 3125 3130 3135 3140 3145 3150 

* * # * * * 

ATCTGGCTCA GCATAGGGTC ACTTTCCCAT TATGCAAATG GAGATAAAAG CAATTCTGGC 
TAGACCGAGT CGTATCCCAG TGAAACGGTA ATACGTTTAC CTCTATTTTC GTTAAGACCG 

FIG. 6E 
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3155 3160 21S5 2170 3175 2180 2135 3190 2195 3200 2205 2210 
****** 
TGTCCAGGAG GTAATCTGAC CGTTCTATTG TGTGGATGAC CACATAAGAA GGCAATTTTA 
ACAGGTCCTC GATTAGACTG GCAAGATAAC ACACCTACTG GTGTA7TCTT CCGTTAAAAT 

3215 3220 2225 2220 5235 2240 2245 2250 2255 3250 2265 3270 
* * * * • «. 

GTGTATTAAT CA7AGATTAT TATAAACTAT AAACTTAAGG GCAAGGAGTT 7ATTACAATC 
CACATAATTA GTATCTAATA ATATTTGATA TTTGAATTCC CGTTCCTCAA ATAATCTTAC 

3275 3230 3235 2290 2295 2300 2205 2310 3315 3320 2325 3330 

•****» 

TATCTTTATT AAAACAAAAG GGTGTATAGT GTTCACAAAC TGTGAAAATA GTGTAAGAAC 
ATAGAAATAA TTTTGT7TTC CCACATATCA CAAG TGTTTG ACA CTTTTA T CACATTCTTG 

3335 3340 2245 2250 2255 2260 2265 2370 2375 3380 2385 3390 
* * 

7GTACATTGT GAGCTCTGGT TATTTTTCTC TTGTACCATA GAAAAATGTA TAAAAATTAT 
ACATGTAACA CTGGAGACCA ATAAAAAGAG AACA7T2GTAT CTT7TTACAT ATTTTTAATA 

3395 2400 2405 2410 2415 2420 2425 3430 2435 3440 2445 3450 
****** 

CAAAAAGCTA ATGTGCAGGG ATATTGCCTT ATTTGTCTGT AAAAAATGGA GCTCAGTAAC 
GTTTTTCGAT TACACGTCCC TATAACGGAA TAAACAGACA TTTTTTACCT CGAGTCATTG 

3455 3460 2455 3470 3475 3480 2485 3490 3495 
* * * * 

ATAACTGCTT CTTGGAGCTT TGGAATATTT TATCCTGTAT TCTTGTTT (SEQ ID NO:5) 
TATTGACGAA GAACCTCGAA ACCTTATAAA ATAGGACATA AGAACAAA 



FIG. 6F 
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5 10 L5 20 25 30 25 40 45 *0 

CAACA ATG GCG GCT CCG AGC CCG AGC GGT GGC GGC GGC AGC GGC ACC CCC 
GTTCT TAC CGC CGA GGC TCG GGC TCG CCA CCG CCG CCG TCG CCG TGG GGG 
Met Ala Ala Pro Ser Pro Ser Giy Gly Gly Gly Ser Gly Thr Pro 

55 50 65 70 75 30 85 90 95 

* • » » 

GGC CCC GTA GGG TCC CCG GCG CCA GGC CAC CCG GCC GTC AGC AGC ATG 
CCG GGG CAT CCC AGG GGC CGC GGT CCG GTG GGC CGG CAG TCG TCG TAC 
Gly Pro Val Gly Ser Pro Ala Pro Gly His Pro Ala Val Ser Ser Met> 

100 105 110 115 120 125 130 135 140 145 
* * * * • 

CAG GGT AAA CGC AAA GCA CTG AAG TTG AAT TTT GCA AAT CCA CCT TTC 
GTC CCA TTT GCG TTT CGT GAC TTC AAC TTA AAA CGT TTA GGT GGA AAG 

Gin Gly Lys Arg Lys Ala Leu Lys Leu Asn Phe Ala Asn Pro Pro Phe> 

150 155 160 165 170 175 180 185 190 
* * • • 

AAA TCT ACA GCA AGG TTT ACT CTG AAT CCC AAT CCT ACA GGA GTT CAA 
TTT AGA TGT CGT TCC AAA TGA GAC TTA GGG TTA GGA TGT CCT CAA GTT 
Lys Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn Pro Thr Gly Val Gln> 

195 200 205 210 215 220 225 230 235 240 
* * * * # 

AAC CCA CAC ATA GAG AGA CTG AGA ACA CAC AGC ATT GAG TCA TCA GGA 
TTG GGT GTG TAT CTC TCT GAC TCT TGT GTG TCG TAA CTC AGT AGT CCT 
Asn Pro His He Glu Arg Leu Arg Thr His Ser He Glu Ser Ser Gly> 

245 250 255 260 265 270 275 280 285 290 
* * • * * 

AAA CTG AAG ATC TCC CCT GAA CAA CAC TGG GAT TTC ACT GCA GAG GAC 
TTT GAC TTC TAG AGG GGA CTT GTT GTG ACC CTA AAG TGA CGT CTC CTG 
Lys Leu Lys He Ser Pro Glu Gin His Trp Asp Phe Thr Ala Glu Asp> 

295 300 305 310 315 320 325 330 335 

* * * 9 

TTG AAA GAC CTT GGA GAA ATT GGA CGA GGA GCT TAT GGT TCT GTC AAC 
AAC TTT CTG GAA CCT CTT TAA CCT GCT CCT CGA ATA CCA AGA CAG TTG 
Leu Lys Asp Leu Gly Glu He Gly Arg Gly Ala Tyr Gly Ser Val Asn> 



340 345 250 355 360 265 370 375 280 385 
* * * * * 

AAA ATG GTC CAC AAA CCA AGT GGG CAA ATA ATG GCA GTT AAA AGA ATT 
TTT TAC CAG GTG TTT GGT TCA CCC GTT TAT TAC CGT CAA TTT TCT TAA 
Lys Met Val His Lys Pro Ser Gly Gin He Met Ala Val Lys Arg Ile> 

390 395 400 405 410 415 420 425 430 
• * * * * 

CGG TCA ACA GTG GAT GAA AAA GAA CAA AAA CAA CTT CTT ATG GAT TTG 
GCC AGT TGT CAC CTA CTT TTT CTT GTT TTT GTT GAA GAA TAC CTA AAC 
Arg Ser Thr Val Asp Glu Lys Glu Gin Lys Gin Leu Leu Met Asp Leu> 

435 440 445 450 455 460 465 470 475 480 
***** 

GAT GTA GTA ATG CGG AGT AGT GAT TGC CCA TAC ATT GTT CAG TTT TAT 
CTA CAT CAT TAC GCC TCA TCA CTA ACG GGT ATG TAA CAA GTC AAA ATA 
Asp Val Val Met Arg Ser Ser Asp Cys Pro Tyr He Val Gin Phe iyr> 



FIG. 7A 
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485 490 495 500 505 510 515 520 525 530 
* * » * 

GGT GCA CTC TTC AGA GAG GGT GAC TGT TGG ATC TGT ATG GAA CTC ATG 
CCA CGT GAG AAG TCT CTC CCA CTG ACA ACC TAG ACA TAG CTT GAG TAC 
Gly Ala Leu Phe Arg Glu Gly Asp C/s Trp lie Cys Met Glu Leu Met> 

535 540 545 550 555 560 565 570 575 
* * » * 

TCT ACC TCG TTT GAT AAG TTT TAC AAA TAT GTA TAT AGT GTA TTA GAT 
AGA TGG AGC AAA CTA TTC AAA ATG TTT ATA CAT ATA TCA CAT AAT CTA 
Ser Thr £er Phe Asp Lys Phe Tyr Lys Tyr Val Tyr Ser Val Leu Asp> 

580 585 590 595 600 605 610 615 620 625 




Vai> 

630 535 640 645 650 655 660 665 570 

AAA GCA CTA AAC CAC TTA AAA GAA AAC TTG AAA ATT ATT CAC AGA GAT 
TTT CGT GAT TTG GTG AAT TTT CTT TTG AAC TTT TAA TAA GTG TCT CTA 
Lys Ala Leu Asn His Leu Lys Glu Asn Leu Lys He He His Arg Asp> 

675 680 685 690 695 700 705 710 715 720 

ATC AAA CCT TCC AAT ATT CTT CTG GAC AGA AGT GGA AAT ATT AAG CTC 
TAG TTT GGA AGG TTA TAA GAA GAC CTG TCT TCA CCT TTA TAA TTC GAG 
He Lys Pro Ser Asn He Leu Leu Asp Arg Ser Gly Asn He Lys Leu> 

725 730 735 740 745 750 755 760 765 770 

* * ♦ 

TGT GAC TTC GGC ATC AGT GGA CAG CTT GTG GAC TCT ATT GCC AAG ACA 
ACA CTG AAG CCG TAG TCA CCT GTC GAA CAC CTG AGA TAA CGG TTC TGT 
Cys Asp Phe Gly He Ser Gly Gin Leu Val Asp Ser He Ala Lys Thr> 

775 780 735 790 795 300 805 810 915 
* * * * 

AGA GAT GCT GGC TGT AGG CCA TAC ATG GCA CCT GAA AGA ATA GAC CCA 
TCT CTA CGA CCG ACA TCC GGT ATG TAC CGT GGA CTT TCT TAT CTG GGT 
Arg Asp Ala Gly Cys Arg Pro Tyr Met Ala Pro Glu Arg He Asp Pro> 

820 825 330 935 840 845 850 855 860 865 
* * * * * 

AGC GCA TCA CGA CAA GGA TAT GAT GTC CGC TCT GAT GTC TGG AGT TTG 
TCG CGT AGT GCT GTT CCT ATA CTA CAG GCG AGA CTA CAG ACC TCA AAC 
Ser Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser Asp Val Trp Ser Leu> 

870 875 880 885 890 895 900 905 910 
* * * * • 

GGG ATC ACA TTG TAT GAG TTG GCC ACA GGC CGA TTT CCT TAT CCA AAG 
CCC TAG TGT AAC ATA CTC AAC CGG TGT CCG GCT AAA GGA ATA GGT TTC 
Gly He Thr Leu Tyr Glu Leu Ala Thr Gly Arg Phe Pro Tyr Pro Lys> 

915 920 925 930 935 940 945 950 955 960 
* * * * # 

TGG AAT AGT GTA TTT GAT CAA CTA ACA CAA GTC GTG AAA GGA GAT CCT 
ACC TTA TCA CAT AAA CTA GTT GAT TGT GTT CAG CAC TTT CCT CTA GGA 
Trp Asn Ser Val Phe Asp Gin Leu Thr Gin Val Val Lys Gly Asp Pro 

965 970 975 980 985 990 995 1000 1005 1010 
* * * * * 

FIG. 7B 
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CCG CAG CTG AGT AAT TCT GAG GAA AGG GAA TTC TCC CCG AGT TTC ATC 

GGC GTC GAC TCA TTA AGA CTC CTT TCC CTT AAG AGG GGC TCA AAG TAG 

Pro Gin Leu Ser Asn Ser Glu Giu Arg Giu Fhe Ser Pro Ser Phe Ile> 

1015 1020 1025 1030 1035 1040 1045 1050 1055 
* # » * 

AAC TTT GTC AAC TTG TGC CTT ACG AAG GAT GAA TCC AAA AGG CCA AAG 
TTG AAA CAG TTG AAC ACG GAA TGC TTC CTA CTT AGG TTT TCC GGT TTC 
Asn Phe Val Asn Leu Cys Leu Thr Lys Asp Glu Ser Lys Arg Pro Lys> 



1060 1065 1C70 1075 1080 1085 1090 1095 1100 Li 



105 



TAT AAA GAG CTT CTG AAA CAT CCC TTT ATT TTG ATC TAT GAA GAA CGT 
ATA TTT CTC GAA GAC TTT GTA GGG AAA TAA AAC TAC ATA CTT CTT GCA 
Tyr Lys Glu Leu Leu Lys His Pro Phe He Leu Met Tyr Glu Glu Arg> 

1110 1115 1120 1125 1130 1135 1140 1145 1150 



GCC GTT GAG GTC GCA TGC TAT GTT TCT AAA ATC CTG GAT CAA ATG CCA 
CGG CAA CTC CAG CGT ACG ATA CAA ACA TTT TAG GAC CTA GTT TAC GGT 
Ala Val Glu Val Ala Cys Tyr Val Cys Lys He Leu Asp Glu Met Pro> 

1155 1160 1165 1170 1175 1180 1185 1190 1195 1200 
* * * * * 

GCT ACT CCC AGC TCT CCC ATG TAT GTC GAT TGATAT CGYTGCTACA 
CGA TGA GGG TCG AGA GGG TAC ATA CAG CTA ACTATA GCRACGATGT 
Ala Thr Pro Ser Ser Pro Met T/r Val Asp> ( SE q id nO:8) 

1205 1210 1215 1220 1225 1230 1235 1240 1245 1250 1255 1260 

* ***** 
TCAGACTCTA GAAAAAAGGG CTGAGAGGAA GCAAGACGTA AAGAATTTTC ATCCCGTATC 
AGTCTGAGAT CTTTTTTCCC GACTCTCCTT CGTTCTGCAT TTCTTAAAAG TAGGGCATAG 

1265 1270 1275 1280 1285 1290 1295 1300 1305 1310 1315 1320 

* * * * * * 
ACAGTGTTTT TATTGCTCGC CCAGACACCA TGTGCAATAA GATTGGTGTT C GTTTC CATC 
TGTCACAAAA ATAACGAGCG GGTCTGTGGT ACACGTTATT CTAACCACAA GCAAAGGTAG 

1325 1330 1335 1340 1345 1350 1355 1360 1365 1370 1375 1380 

* ***** 

ATGTCTGTAT ACTCCTGTCA CCTAGAACGT GCATC CTTGT AATACCTGAT TGATCACACA 
TACAGACATA TGAGGACAGT GGATCTTGCA CGTAGGAACA TTATGGACTA ACTA G TGTGT 

1385 1390 1395 1400 1405 1410 1415 1420 1425 1430 1435 1440 

* * * * * * 
GTGTTAGTGC TGGTCAGAGA GACCTCATCC TG CTCTTTTG TGATCAACAT ATTCATGAAA 
CACAATCACG ACCAGTCTCT CTGGAGTAGG ACGAGAAAAC ACTA CTTOT A TAAGTACTTT 

1445 1450 1455 1460 1465 1470 1475 1480 1485 1490 1495 1500 

* ***** 
TCTGGAAGTC AGTACGATCA AGTTGTTGAC TGTGATTAGA TCACATCTTA AATTCATTTC 
ACACCTTCAG TCATGCTAGT TCAACAACTG ACACTAATCT AGTGTAGAAT TTAAGTAAAG 

1505 1510 1515 1520 1525 1530 1535 1540 1545 1550 1555 1560 
****** 

TA ^CTCAAA ACCTGGAGAT GCAGCTACTG GAATGGTGTT TTCT CA GACT TCCAAATCCT 
ATCTGAGTTT TGGACCTCTA CGTCGATGAC CTTACCACAA AACA G TC TC A AGGTTTAGGA 

1565 1570 1575 1580 1585 1590 1595 1600 1605 1610 1615 1620 

* * * * * * 

GGAAGGACAC AGTGATGAAT GTACTATATC TGAACATAGA AACTCGGGCT TGAGTGAGAA 
CCTTCCTGTG TCACTACTTA CATCATATAG ACITCTATCT TTGAGCCCGA ACTCACTCTT 

FIG, 7C 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



21/54 

1625 1630 1535 1640 1645 1550 1555 1560 1565 1670 1675 1680 

gg£TS£ ff?f5™« I^CTSGAGCT GGGAGACAaI GGAGGAATl4 

CTCGA^CGTG --^GTTGCTC TGTGTAACGG AAGACCTCGA CCCTCTCTTT CCTCCTTAAA 

1685 1590 1595 1700 1705 1710 1715 1720 1725 1730 173S l 740 

™£Y£* S^S? 30 ™ ~GA7TACTG ATG7GATATT CTGTTGCTTT ACAGT7ACAG 
TGAAACLaAGT GGTTCACG7T ATCTAATGAC TACACTA7AA GACAACGAAA TGTCAATGTC 

1745 1750 1755 1760 1765 1770 1775 1780 1785 1790 1795 1800 

USaCaS ^^CTCT TTGAAATA7C ATGTTAAATT 

"Av- i n wiAAC , --v^CT™- ^wo«GTCGG7 T7AAAGGACA AACTTTATAG TACAATTTAA 

1805 1310 1315 1320 1325 1830 1335 1840 1345 1850 1355 1860 

^£g£FF ^^"^ 7TGCGTTCAA AGAGGTGAAC A7TAAAATAT 

TCTTACrr^ rt^rtGrvrijiTGG T7TTTGGTAC AACGCAAGTT 7 > CTCCACTTG 7AATTTTATA 

1865 1370 1375 1380 1385 1890 1395 1900 1905 1910 1915 i 920 

AGAGACAGGA CAGAATGTGT TCTTTTCTCC TCTACCAGTC CTATITT* 
TCTCTGTCCT GTCTTACACA AGAAAAGAGG AGa£gtS£ GAtSaGT* Sggjgg 

1925 1930 1935 1940 1945 1950 1955 1960 1965 1970 1975 1980 

tcaggactct gccacttgtc aaagaaggtc ctgatcctaa gaatttttca ^cicagaat 
agtcctcaga cggtcaacag tttcttccac gactaggait cttaaaaIS SSSSJI 

1985 1990 1995 2000 2005 2010 2015 2020 2025 2030 2035 2040 

CCCAACTTCA TGTTCCACCT GCCACAAACC ACCAGGACTG AAAGAAGAAA 
AGCCACACGA CGGTTGAACT ACAAGG7GGA CG GTGTTT5 G TGGTCCTGAC T—cKgffr' 

2045 2050 2055 2060 2065 2070 2075 2080 2085 2090 2095 2100 

^SZ^ST TACAGATGrr 7TTAATTCTA GTATTTTATC 7GGAACAACT 
TGTCATGTCT TCCG7TTCAA A7GTCTACAA AAATTAAGAT CATAAAATAG ACCT7GTTGA 

210S 2110 2115 2120 212S 2130 213S 2140 2145 2150 2155 2160 

aSSSS ™™THHS x^™^ 0 aagcctsata CTTTAGCCAT CATAACTCAc" 
ACATCGTCGA TATATAAAGG GGAACCAGGG 7TCGGACTAT GAAATCGGTA GTATTGAGTG 

2165 2170 2175 2180 2185 2190 2195 2200 2205 2210 2215 2220 

* * * * 

l^^S^B. AAGTAGCTAG TAGCAATGTG CCTTGATTCA TTAGATAAAG ATTTCTAGTA 
ATTCTCCCTC TTCATCGATC ATCGTTACAC GGAACTAACT AATCTATTTC 7AAAGA?5t* 

2225 2230 2235 2240 2245 2250 2255 2260 2265 2270 2275 2280 

ggggg^ CAGTTGTr^ Cncrr GC CA TCACTGGTCC AGGTCITCAG 

CCGTCGTXTT CTGGTTTAGA GTCAACAAAC GAAGAACGGT AGTCACCAGG TCCAGAAGTC 

2285 2290 2295 2300 2305 2310 2315 2320 232S 2330 2335 2340 

^TCCGAATC PCTTTCCCrr CCCCTGTGGT CTATTGTCGC TATGTGACTT GCGCTTAATC 
AAAGGCTTAG AGAAAGGGAA GGGGACACCA GATAACAGCG ATaEaSgAA SgAjSSS 

2345 2350 2355 2360 2365 2370 2375 2380 2385 2390 2395 2400 
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* * • 

CAATATTTTG CCTTTTTTCT ATATCAAAAA ACCTTTACAG TTAGCAGGGA r^r^ir 
GTTATAAAAC GGAAAAAAGA TATAGTTTTT TGGAAATGTC AA^Sg^ 

2405 2410 2413 2420 2425 2430 2435 2440 2445 243C 2455 2460 

Sc™S IH^SSS ™ AGTG ^ TTAAAAGGCT AAGAATAGTG 

^CTCu.«AAA "^u^,. rtoAGAGTATT AGCGATCACA AATTTTCCGA TTCTTATCAC 

2465 2470 2475 2480 2485 2490 2495 2500 2505 2510 2515 2520 

^TSTSJ^ 2^ TAAAGA GGCATCTTrr CTAGAGACAC ATTCGACCAG 
CCCGGoTTG^ u.^w^CATC CACTATTTCT CCGTAGAAAA GATCTCTGTG TAACCTGGTC 

2525 2530 2535 2540 2545 2550 2555 2550 2565 2570 2575 2580 

£TSS2?SS 5^2^ EE7F TACCrrr ^TCACCTOT TAGAACCTCT CGTAGPCCAT 
.ACTC-^ uTTTC-.wGTC GGAAA7GCAA GTAGTGGACG ATCTTGGAGA GCATCAGGTA 

2585 2550 2395 2500 2505 2510 2515 2620 2625 2630 2535 2640 

gtgSaaIS 2^aJS£ -TEzSE ^^ TAC ^^aaagcaaI ACAAAACCCT 

biu.^Gn «Cv.GT«ACC. .^AGATGACC - .vi-i-ivATG I ITTICGTTT TGTTTTGGGA 
2645 2650 2655 2660 2665 2670 2675 2680 2585 2690 2695 2700 

SSSS^T ££5SS°°E A ?r rAAGTAT ^' ^ l^ TTCACTTACC CATTAGCCAG 
GTCGTCW^A 7GTTC7CCGG TAAATTCATA GAACACGAAG AAGTGAATGG GTAATCGGTC 

2705 2710 2715 2720 2725 2730 2735 2740 2745 2750 2755 2760 

* * * » * 

GTTCTCATTA GGTTTTGCTT GGGCCTCCCT GGCACTGAAC ~™^GTJ~r~> ~-ZTririi~r 
CAAGAGTAAT CCAAAACGAA CCCGGAGGGA C^SSS GA^SSI cS 

2765 2770 2775 2780 2785 2790 2795 2800 2805 2810 2315 2820 

£££SS££ FH^^T I^y"^ ^W™** AACAGTCATG 3CCTGAGATG 
CTTCGTCGTG rt w*CTC«CCA AGTTCGTGTG ACCTTATATT TTGTCAGTAC CGGACTCTAC 

2825 2830 2835 2840 2845 2350 2355 2860 2865 2870 2375 2380 

SSSSS ^S A S^ S^i^P GCACGTATIG CTGT3TCTCC 7CTCAGAGTG 
GTCCACTACG GTAATGTCTT GGTTTAGCAC CGTGCATAAC GACACAGAGG AGAGTCTCAC 

2885 2890 2895 2900 2905 2910 2915 2920 2925 2930 2935 2940 

* • • • « 

ACAGTCATAA ATACTGTCAA ACAATAAAGG GAGAATGGTG CTGT7TAAAG TCACATCCCT 
TGTCAGTATT TATGACAGTT TGTTATTTCC CTCTTACCAC GACAAATTTC AG^AGgS 

2945 2950 2955 2960 2965 2970 2975 2980 2985 2990 2995 3000 

StoIIS ^HS^ SSSEF* CTITCATCTA CTCGCCTCAT 7TCCCTATCT 
CATTTAACGT CTTAAGTTTT CACTAATAGA GAAACTAGAT GAACGGAGTA AAGGGATAGA 

3005 3010 301S 3020 3025 3030 3035 3040 3045 3050 3055 3060 

aSSgSS SSSSSS JSP 0 *" TGCCACTGTT CTGAAAGGAG ACATTGCTCT 
AGAGG««GTC CCATAGGATT TGAAATCTCA AGGGTGACAA GA CTTTCCTC TGTAACGAGA 

3065 3070 3075 3080 3085 3090 3095 3100 3105 3110 3115 3120 

ATGTCTGCCT TCGACCACAG CAAGCCATCA TCCICCATT3 CTCCCGGGGA CTCAAGAGGA 

FIG. 7E 
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TACAGACGGA AGCTGGTGTC G7TCGG7AGT AGGAGGTAAC GAGGGCZCC7 GAGTTCTCCT 

3125 3130 2135 3140 3145 3150 3155 3150 3155 3170 3175 3180 
****** 

ATCTGTTTCT CTGCTGTCAA CTTCCCATCT GGCTCAGCAT AGGGTCACTT 7GCCATTATG 
TAGACAAAGA GACGACAGTT GAAGGGTAGA CCGAGTCGTA TCCCAGTGAA ACGGTAATAC 

3185 3190 3135 3200 2205 3210 2215 3220 3225 3230 2235 2240 
****** 

CAAATGGAGA TAAAAGCAAT TCTGGCTGTC CAGGAGCTAA TCTGACCGTT CTATTGTGTG 
GTTTACCTCT ATTTTCGTTA AGACCGACAG GTCCTCGATT AGACTGGCAA GATAACACAC 

3245 3250 2255 2260 2265 3270 2275 3280 3285 3290 2295 2300 
****** 

GATGACCACA TAAGAAGGCA ATTTTAGTGT ATTAATCATA GATTATTATA AACTA7AAAC 
CTACTGGTGT ATTCTTCCGT TAAAATCACA TAATTAGTAT CTAATAATA7 TTGATATTTG 

3305 3310 3215 2320 2225 2330 3335 3340 3345 3350 3355 3360 
****** 

TTAAGGGCAA GGAGTTTATT ACAATGTATC TTTATTAAAA CAAAAGGGTG TATAGTGTTC 
AATTCCCGTT CCTCAAATAA TGTTACATAG AAATAATTTT GTTTTCCCAC A7ATCACAAG 

3365 3370 2275 2380 2285 3390 2395 3400 3405 341C 2415 2420 
* * * * * * 

ACAAACTGTG AAAA7AGTG7 AAGAACTGTA CATTGTGAGC TCTGGTTATT . rTCTCITGT 
TGTTTGACAC 77TTATCACA T TCTT G ACAT GTAACACTCG AGACCAATAA AAAGAGAACA 

3425 3430 3435 3440 3445 3450 3455 3460 3465 3470 3475 3480 
****** 

ACCATAGAAA AATGTATAAA AATTATCAAA AAGCTAATGT GCAGGGATAT 7GCCT7ATTT 
TGGTATCTTT TTACATATTT 1TAATAGTTT TTCGATTACA CGTCCCTATA ACGGAATAAA 

3485 3490 3495 3500 3505 3510 2515 3520 3525 3530 3535 3540 
****** 

GTCTGTAAAA AA7GGAGCTC AGTAACATAA CTGCTTCTTG GAGCTTTGGA ATATTTTATC 
CAGACATTTT TTACCTCGAG TCATTGTATT GACGAAGAAC C7CGAAACC7 7A7AAAA7AG 

3545 3550 
* 

CTGTATTCTT GTTT (SEQ ID NO; 7) 
GACATAAGAA CAAA 
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5 10 15 20 25 30 35 40 45 50 

* * « * * 

CTCCCAACA ATG GCG GCT CCG AGC CCG AGC GGC GGC GGC GGC TCC GGG GGC 
GAGGGTTGT TAC CGC CCA GGC TCG GGC TCG CCG CCG CCG CCG AGG CCC CCG 
Met Ala Ala Pro Ser Pro Ser Gly Gly Gly Giy Ser Gly Gly> 

55 60 55 70 75 30 85 90 95 

* * * * 

GGC AGC GGC AGC GGC ACC CCC GGC CCC GTA GGG TCC CCG GCG CCA GGC 
CCG TCG CCG TCG CCG TGG GGG CCG GGG CAT CCC AGG GGC CGC GGT CCG 
Gly Ser Gly Ser Gly Thr Pro Gly Pro Val Gly Ser Pro Ala Pro Gly> 

LOO 105 110 115 120 125 130 135 140 145 

* * * » * 

CAC CCG GCC GTC AGC AGC ATG CAG GGT AAA CGC AAA GCA CTG AAG TTG 
GTG GGC CGG CAG TCG TCG TAC GTC CCA TTT Gs-\j TTT Co l GnC TTC /-^AC 
His Pro Ala Val Ser Ser Met Gin Gly Lys Arg Lys Ala Leu Lys Leu> 

150 155 150 155 170 175 ISO 125 190 195 

# • » » * 

AAT TTT GCA AAT CCA CCT TTC AAA TCT ACA GCA AGG TTT ACT CTG AAT 
TTA AAA CCT TTA GGT GGA AAG TTT AGA TGT CGT TCC AAA TGA GAC TTA 
Asn Phe Ala Asn Pro Pro Phe Lys Ser Thr Ala Arg Phe Thr Leu Asn> 

200 205 210 215 220 225 230 235 240 

# # * * * 

CCC AAT CCT ACA GGA GTT CAA AAC CCA CAC ATA GAG AGA CTG AGA ACA 
GGG TTA GGA TGT CCT CAA GTT TTG GGT GTG TAT CTC TCT GAC TTT TGT 
Pro Asn Pro Thr Gly Val Gin Asn Pro His lie Glu Arg Leu Arg Thr> 

245 250 255 260 265 270 275 280 285 290 
. • * * * 

CAC AGC ATT GAG TCA TCA GGA AAA CTG AAG ATC TCC CCT GAA CiJV CAC 
GTG TCG TAA CTC AGT AGT CCT TTT GAC TTC TAG AGG GGA CTT GTT GTG 
His Ser lie Glu Ser Ser Gly Lys Leu Lys He Ser Pro Glu Gin His> 

295 300 305 310 315 320 325 330 235 

* * * * 

TGG GAT TTC ACT GCA GAG GAC TTG AAA GAC CTT GGA GAA ATT GGA CGA 
ACC CTA AAG TGA CGT CTC CTG AAC TTT CTG GAA CCT CTT TAA CCT GCT 
Trp Asp Phe Thr Ala Glu Asp Leu Lys Asp Leu Giy Glu He Giy Arg> 

340 345 350 355 360 365 370 375 380 385 

* # * * * 

GGA GCT TAT GGT TCT GTC AAC AAA ATG GTC CAC AAA CCA AGT GGG CAA 
CCT CGA ATA CCA AGA CAG TTG TTT TAC CAG GTG TTT GGT TCA CCC GTT 
Gly Ala Tyr Gly Ser Val Asn Lys Met Val His Lys Pro Ser Giy Gln> 

390 395 400 405 410 415 420 425 430 435 

* * * * * 

ATA ATG GCA GTT AAA AGA ATT CGG TCA ACA GTG GAT GAA AAA GAA CAA 
TAT TAC CGT CAA TTT TCT TAA GCC AGT TGT CAC CTA CTT TTT CTT GTT 
He Met Ala Val Lys Arg lie Arg Ser Thr Val Asp Glu Lys Glu Gln> 

440 445 450 455 460 465 470 475 480 

* » * * * 

AAA CAA CTT CTT ATG GAT TTG GAT GTA GTA ATG CGG AGT AGT GAT TGC 
TTT GTT GAA GAA TAC CTA AAC CTA CAT CAT TAC GCC TCA TCA CTA ACG 
Lys Gin Leu Leu Met Asp Leu Asp Val Val Met Arg Ser Ser Asp Cys> 

FIG. 8A 
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485 490 495 500 505 510 515 520 525 530 

* » * * * 

CCA 7AC ATT GTT CAG TTT TAT GGT GCA CTC TTC AGA GAG GGT GAC TGT 
GGT ATG TAA CAA GTC AAA ATA CCA CGT CAG AAG TCT CTC CCA CTG ACA 
Pro Tyr He Val Gin Phe Tyr Gly Ala Lsu Phe Arg Glu Giy Asp Cys> 

535 540 545 550 555 550 565 570 575 

• * * * 

TGG ATC TGT ATG CAA CTC ATG TCT ACC TCG TTT GAT AAG TTT TAC AAA 
ACC TAG ACA TAC CTT GAG TAC AGA TGG AGC AAA CTA TTC AAA ATG TTT 
Trp He Cys Mec Glu Leu Mec Ser Thr Ser Phe Asp Lys Phe Tyr Lys> 

530 585 590 595 500 605 510 615 620 525 

* * * * * 

TAT GTA TAT AGT GTA TTA GAT GAT GTT ATT CCA GAA GAA ATT TTA GGC 
ATA CAT ATA TCA CAT AAT CTA CTA CAA TAA GGT CTT CTT TAA AAT CCG 
Tyr Val Tyr Ser Val leu Asp Asp Val He Pro Glu Glu He Leu Gly> 

530 535 540 545 650 655 650 665 570 575 

* w » * * 

AAA ATC ACT TTA GCA ACT C-TG AAA GCA CTA AAC CAC TTA AAA GAA AAC 
TTT TAG TGA AAT CGT TGA CAC TTT CGT GAT TTG GTG AAT TTT CTT TTG 
Lys He Thr Leu Ala Thr Val Lys Ala Leu Asn His Leu Lys Glu Asn> 

680 685 590 695 700 "35 710 715 ^20 

* * * * * 

TTG AAA ATT ATT CAC AGA GAT ATC AAA CCT TCC AAT ATT CTT CTG GAC 
AAC TTT TAA TAA GTG TCT CTA TAG TTT GGA AGG TTA TAA GAA GAC CTG 
Leu Lys He He His Arg Asp He Lys Pro Ser Asn He Leu Leu Asp> 

725 730 735 740 745 750 755 760 765 770 

* * * * * 

AGA AGT GGA AAT ATT AAG CTC TGT GAC TTC GGC ATC AGT GGA CAG CTT 
TCT TCA CCT TTA TAA TTC GAG ACA CTG AAG CCG TAG TCA CCT GTC GAA 
Arg Ser Gly Asn He Lys Leu Cys Asp Phe Gly He Ser Gly Gin Leu> 

775 780 785 790 795 300 805 310 315 

* • * * 

GTG GAC TCT ATT GCC AAG ACA AGA GAT C-CT GGC TGT AGG CCA TAC ATG 
CAC CTG AGA TAA CGG TTC TGT TCT CTA CGA CCG ACA TCC GGT ATG TAC 
Val Asp Ser He Ala Lys Thr Arg Asp Ala Gly Cys Arg Pro Tyr Met> 

820 825 830 335 340 845 850 855 860 865 

* * * * * 

GCA CCT GAA AGA ATA GAC CCA AGC GCA TCA CGA CAA GGA TAT GAT GTC 
CGT GGA CTT TCT TAT CTG GGT TCG CGT AGT GCT GTT CCT ATA CTA CAG 
Ala Pro Glu Arg He Asp Pro Ser Ala Ser Arg Gin Gly Tyr Asp Val> 

870 875 880 885 890 895 900 905 910 915 

* * * * * 

CGC TCT GAT GTC TGG AGT TTG GGG ATC ACA TTG TAT GAG TTG GCC ACA 
GCG AGA CTA CAG ACC TCA AAC CCC TAG TGT AAC ATA CTC AAC CGG TGT 
Arg Ser Asp Val Trp Ser Leu Gly He Thr Leu Tyr Glu Leu Ala Thr> 

920 925 930 935 940 945 950 955 360 

* • * * * 

GGC CGA TTT CCT TAT CCA AAG TGG AAT AGT GTA TTT GAT CAA CTA ACA 
CCG GCT AAA GGA ATA GGT TTC ACC TTA TCA CAT AAA CTA GTT GAT TGT 
Gly Arg Phe Pro Tyr Pro Lys Trp Asn Ser Val Phe Asp Gin Leu Thr> 

965 970 975 980 985 990 995 1000 1005 1010 

* # » * * 
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CAA GTC GTG AAA GGA GAT CCT CCG CAG CTG AGT AAT TCT GAG GAA AGG 
GTT CAG CAC TTT CCT CTA GGA GGC GTC GAC TCA TTA AGA CTC CTT TCC 
Gin Val Val Lys Gly Asp Fro Pro Gin Leu Ser Asn Ser Glu Giu Arg> 

1015 1020 1225 1030 1035 1040 1045 1050 1055 
* » * # 

GAA TTC TCC CCG AGT TTC ATC AAC TTT GTC AAC TTG TGC CTT ACG AAG 
CTT AAG AGG GGC TCA AAG TAG TTG AAA CAG TTG AAC ACG GAA TGC TTC 
Glu Phe Ser Pro Ser Phe He Asn Phe Val Asn Leu Cys Leu Thr Lys> 

1060 1065 1070 1075 1080 1085 1090 1095 1100 1105 
* * • * * 

GAT GAA TCC AAA AGG CCA AAG TAT AAA GAG CTT CTG AAA CAT CCC TTT 
CTA CTT AGG TTT TCC GGT TTC ATA TTT CTC GAA GAC TTT GTA GGG AAA 
Asp Glu Ser Lys Arg Pro Lys Tyr Lys Glu Leu Leu Lys His Fro Phe> 

1110 1115 1120 1125 1130 1135 1140 1145 1150 1155 
• * • « • 

ATT TTG ATG TAT GAA GAA CGT GCC GTT GAG GTC GCA TGC TAT GTT TGT 
TAA AAC TAG ATA CTT CTT GCA CGG CAA CTC CAG CGT ACG ATA CAA ACA 
He Leu xMec Tyr Glu Glu Arg Ala Val Glu Val Ala Cys Tyr Val Cys> 

1160 1165 1170 1175 1180 1185 1190 1195 1200 
* ♦ * # * 

AAA ATC CTG GAT CAA ATG CCA GCT ACT CCC AGC TCT CCC ATG TAT GTC 
TTT TAG GAC CTA GTT TAC GGT CGA TGA GGG TCG AGA GGG TAC ATA CAG 
Lys He Leu Asp Gin Met Pro Ala Thr Pro Ser Ser Pro Met Tyr Val> 

1205 1210 1215 1220 1225 1230 1235 1240 1245 1250 1255 1260 
• * * • * • 

GAT TGAT ATCGCTGCTA CATCAGACTC TAGAAAAAAG GGCTGAGAGG AAGCAAGACG 
CTA ACTA TAGCGACGAT GTAGTCTGAG ATCTTTTTTC CCGACTCTCC TTCGTTCTGC 
Asp> (SEQ ID N0:i0) 

1265 1270 1275 1280 1285 1290 1295 1300 1305 1310 1315 1320 

• * # * * * 

TAAAGAATTT TCATCCCGTA TCACAGTGTT TTTATTGCTC GCCCAGACAC CATGTGCAAT 
ATTTCTTAAA AGTAGGGCAT AGTGTCACAA AAATAACGAG CGG G TCT G TG GTACACGTTA 

1325 1330 1335 1340 1345 1350 1355 1360 1365 1370 1375 1380 

• * * * * • 

AAGATTGGTG TTCGTTTCCA TCATGTCTGT ATACTCCTGT CACCTAGAAC GTGCATCCTT 
TTCTAACCAC AAGCAAAGGT AGTACAGACA TATGAGGACA GTGGATCTTG CACGTAGGAA 

1385 1390 1395 1400 1405 1410 1415 1420 1425 1430 1435 1440 

• * ♦ * * * 

GTAATACCTG ATTGATCACA CAGTGTTAGT GCTGGTCAGA GAGACCTCAT CCTGCTCTTT 
CATTATGGAC TAACTAGTGT GTCACAATCA CGACCAGTCT CTCTGGAGTA GGACGAGAAA 

1445 1450 1455 1460 1465 1470 1475 1480 1485 1490 1495 1500 

* * * * * * 

TGTGATGAAC ATATTCATGA AATGTGGAAG TCAGTACGAT CAAGTTGTTG ACTGTGATTA 
ACACTACTT3 TATAAGTACT TTACACCTTC AGTCATGCTA GTTCAACAAC TGACACTAAT 

1505 1510 1515 1520 1525 1530 1535 1540 1545 1550 1555 1560 

* * * • * * 

GATCACATCT TAAATTCATT TCTAGACTCA AAACCTGGAG ATGCAGCTAC TGGAATGGTG 
CTAGTGTAGA ATTTAAGTAA AGATCTGAGT TTTGGACCTC TACGTCGATG ACCTTACCAC 

1565 1570 1575 1580 1585 1590 1595 1600 1605 1610 1S15 1620 

* * * * * ♦ 

TTTTGTCAGA CTTCCAAATC CTGGAAGGAC ACAGTGATGA ATGTACTATA TCTGAACATA 

FIG. 8C 
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AAAACAGTCT GAAGGTTTAG GACCTTCCTG TGTCACTACT TACATGATAT AGACTTGTA7 

1625 1630 1635 1640 1645 1650 1655 1660 1665 1670 1575 1680 

* * * * * * 

GAAACTCGGG CTTGAGTGAG AAGAGCTTGC ACAGCCAACG AGACACATTG CCTTGTGGAG 
CTTTG AGCCC GAACTCACTC TTCTCGAACG TGTCGGTTGC TCTGTGTAAC GGAAGACCTC 

1685 1690 1695 1700 1705 1710 1715 1720 1725 1730 1735 1740 

* * * * * * 

CTGGGAGACA AAGGAGGAAT TTACTTTCT? CACCAAG7GC AATAGATTAC TGA7GTGATA 
GACCCTCTGT TTCCTCCTTA AATGAAAGAA G7GGTTCACG TTATCTAATG ACTACACTAT 

1745 1750 1755 1760 1765 1770 1775 1780 1785 1790 1795 1800 

* * * * * « 

TTCTGTT GC T TTACAGTTAC AGTTGATGTT TGGGGATCGA TGTGCTCAGC CVATTTCCT 
AAGACAACGA AA7GTCAATG TCAACTACAA ACCCCTAGCT ACACGAGTCG GTTTAAAGGA 

1305 1810 1815 1320 1325 1330 1335 1340 1845 1350 1355 1860 

* * • * * * 

GTTTGAAATA TCATGTTAAA TTAGAA7GAA TTTATCTTTA CCAAAAACCA ^w^CCrT?Z 
CAAACTTTAT AG7ACAATTT AATCTTACTT AAATAGAAAT GGTTTTTGGT ACAACGCAAG 

1865 1870 1875 1880 1885 1890 1395 1900 1905 1910 1915 1920 

* * * * * * 

AAAGAGGTGA ACATTAAAAT ATAGAGACAG GACAGAATGT GTTCTTTTCT CCTCTACCAG 
TTTCT CCACT TGTAATTTTA TATCTCTGTC CTGTCTTACA CAAGAAAAGA GGAGATGGTC 

1925 1930 1935 1940 1945 1950 1955 1960 1965 1970 1975 1980 
« * * * * • 

TCCTATTTTT CAATGGGAAG ACTCAGGAGT CTGCCACTTG TC AAAGAAGG TGCTGATCCT 
AGGA7AAAAA GTTACCCTTC TGAGTCCTCA GACGGTGAAC AGTTTCTTCC ACGACTAGGA 

1985 1990 1995 2000 2005 2010 2015 2020 2025 2030 2035 2040 
****** 

AAGAATTTTT CATTCTCAGA ATTCGGTGTG CTGCCAACTT GATGTTCCAC CIT3CC ACAAA 
TTCTTAAAAA GTAAGAGTCT TAAGCCACAC GACGGTTGAA CTACAAGGTG GACGGTGTTT 

2045 2050 2055 2060 2065 2070 2075 2080 2085 2090 2095 21C0 

* * * * * * 

CCACCAGGAC TGAAAGAAGA AAACAGTACA GAAGGCAAAG TTTACAGATG TTTITAATTC 
GGT G GT CC T G ACTTTCTTCT TTTGTCATGT CTTCCGTTTC AAATGTC7AC AAAAATTAAG 

2105 2110 2115 2120 2125 2130 2135 2140 2145 2150 2155 2160 
****** 

TAGTATTTTA TCTGGAACAA CTTGTAGCAG CTATATATTT CCCCTTGGTC C CAAGC CTGA 
ATCATAAAAT AGACCTTGTT GAACATCGTC GATATATAAA GGGGAACCAG GGTTCGGACT 

2165 2170 2175 2180 2185 2190 2195 2200 2205 2210 2215 2220 
****** 

TACTTTAGCC ATCATAACTC ACTAACAGGG AGAAGTAGCT AGTAGCAATG TGCCTTGATT 
ATGAAATCGG TAGTATTGAG TGATTGTCCC TCTTCATCGA TCATCGTTAC ACGGAACTAA 

2225 2230 2235 2240 2245 2250 2255 2260 2265 2270 2275 2280 

* * * * * * 

GATTAGATAA AGATTTCTAG TAGGCAGCAA AAGACCAAAT CTCAGTTGTT TGCTTCTTGC 
CTAATCTATT TCTAAAGATC ATCCGTCGTT TTCTGGTTTA GAGTCAACAA ACGAAGAACG 

2285 2290 2295 2300 2305 2310 2315 2320 2325 2330 2335 2340 
****** 

CATCACTGGT CCAGGTCTTC AGTTTCCGAA T C riCT ITCC C TICCCCTCTC GTCTATTGTC 
GTAGTGACCA GGTCCAGAAG TCAAAGGCTT AGAGAAAGGG AAGGGGACAC CAGATAACAG 
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2345 2350 2355 2360 2355 2270 2275 2380 2335 2390 2295 2400 

* » * * * * 

GCTATGTGAC TTGCGCTTAA TCCAATATTT TGCCTTTTTT CTATATCAAA AAACCTTTAC 
CGATACACTG AACGCGAATT AGGTTATAAA ACGGAAAAAA GATATAGTTT TTTGGAAA7G 

2405 2410 2415 2420 2425 2430 2435 2440 2445 2450 2455 2460 

* * * * * • 

AGTTAGCAGG G A T G TTCCTT ACCGAGGATT TTTAACCCCC AATCTCTCAT AATCGCTAGT 
TCAATCGTCC CTACAAGGAA 7GGCTCCTAA AAATTGGGGG TTAGAGAG7A TTAGCGATCA 

2465 2470 2475 2480 2485 2490 2495 2500 2505 2510 2515 2520 

* ***** 

GTTTAAAAGG CTAAGAATAG TGGGGCCCAA CCGATGTGGT AGGTGATAAA GAGGCATCTT 
CAAATTTTCC GATTCTTATC ACCCCGGGTT GGCTACACCA TCCACTATTT CTCCGTAGAA 

2525 2530 2535 2540 2545 2550 2555 2560 2565 2570 2575 2580 

* # » • * * 

TTCTAGAGAC ACATTGGACC AGATGAGGAT CCGAAACGGC AGCCITTACG TTGATCACCT 
AAGATCTCTG TGTAACCTGG TCTACTCCTA GGCTTTSCCG TCGGAAATGC AAGTAGTGGA 

2585 2590 2595 2500 2605 2510 2515 2520 2625 2630 2635 2640 
****** 

GCTAGAACCT CTCGTAGTCC ATCACCATTT CTTGGCATTG GAAT TCTACT G GAAAAAAAT 
CGATCTTGGA GAGCATCAGG TAGTGGTAAA GAACCGTAAC CTTAAGATGA CCTTTTTTTA 

2645 2650 2655 2660 2565 2670 2675 2680 2685 2690 2595 2700 

* * * * * * 

ACAAAAAGCA AAACAAAACC CTCAGCACTG TTACAAGAGG CCATTTAAGT ATCTTCTCCT 
TCTTTTTC GT TTTGTTTTGG GAGTCGTGAC AATGTTCTCC GGTAAATTCA TAGAACACGA 

2705 2710 2715 2720 2725 2730 2735 2740 2745 2750 2755 2760 
» ♦ * * * • 

TCTTCACTTA CCCATTAGCC AGGTTCTCAT TAGGTTTTGC TTGGGCCTCC CTGGCACTGA 
AGAAGTGAAT GGGTAATCGG TCCAAGAGTA ATCCAAAACG AACCCGGAGG GACCGTGACT 

2765 2770 2775 2780 2735 2790 2795 2300 2305 2810 2315 2820 
»♦**** 

ACCTTAGGCT TTGTATGACA GTGAAGCAGC ACTGTGAGTG GTTCAAGCAC ACTGGAATAT 
TGGAATCCGA AACATACTGT CACTTCGTCG TGACACTCAC CAAGTTCGTG 7GACCTTATA 

2825 2830 2835 2840 2345 2350 2355 2360 2865 2870 2375 2880 

* * * * * * 

AAAACAGTCA TGGCCTGAGA TGCAGGTGAT GCCATTACAG AACCAAATCG TGGCACGTAT 
TTITCTC AGT ACCGGACTCT ACGTCCACTA CGGTAATGTC TTGGTTTAGC ACCGTGCATA 

2885 2890 2395 2900 2905 2910 2915 2920 2925 2930 2935 2940 
****** 
TGCTGTGTCT CCTCTCAGAG TGACAGTCAT AAATACTGTC AAACAATAAA GG GAGAAT GG 
ACGACACAGA GGAGAGTCTC ACTGTCAGTA TTTATGACAG TTTGTTATTT CCCTCTTACC 

2945 2950 2955 2960 2965 2970 2975 2980 2985 2990 2995 3000 
****** 
TGCTGTTTAA AGTCACATCC CTGTAAATTG CAGAATTCAA AAGTGATTAT CTCTTTGATC 
ACGACAAATT TCAGTGTAGG GACATTTAAC GTCTTAAGTT TTCACTAATA GAGAAACTAG 

3005 3010 3015 3020 3025 3030 3035 3040 3045 3050 2055 3060 

* * * * * * 

TACTTGCCTC ATTTCCCTAT CTTCTC CC CC ACGGTATCCT AAACTTTAGA CTTCCCACTG 
ATCAACGGAG TAAAGGGATA GAAGAGGGGG TGCCATAGGA TTTGAAATCT GAAGGGTGAC 

3065 3070 3075 3080 3085 3090 3095 3100 3105 3110 3115 3120 
****** 
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TTCTCAAAGG AGACATTGCT CTATGTCTGC CTTCGACCAC AGCAAGCCAT CATCCTCCAT 
AAGACTTTCC TCTGTAACGA GATACAGACG GAAGCTGGTG TCGTTCGGTA G7AGGAGGTA 

3125 3130 3135 3140 2145 3150 3155 3160 3165 3170 2175 3180 

* * * * * * 
TGCTCCCGGG GACTCAAGAG GAATCT GT TT CTC T GCTGTC AACTTCCCA7 ~~X3GCTCAGC 
ACGAGGGCCC CTGAGTTCTC CTTAGACAAA GAGACGACAG TTGAAGGGTA GACCGAG7CG 

3185 3190 3195 3200 2205 3210 2215 3220 2225 3230 2235 ^240 

* * 

ATAGGGTCAC TTTGCCATTA TX3CAAATGGA GATAAAAGCA ATTCTGGCTG TCCAGGAGCT 
TATCCCAGTG AAACGGTAAT ACGTTTACCT CTATTTTCGT TAAGACCGAC AGGTCCTCGA 

3245 3250 2255 3260 2265 3270 3275 3280 3285 3290 2295 ^300 

AATC7GACCG TTCTATTGTG TGGATGACCA CATAAGAAGG CAATTTTAGT GTATTAATCA 
TTAGACTGGC AAGATAACAC ACCTACTGGT GTATTCTTCC GTTAAAATCA CATAATTAGT 

3305 3310 2215 3320 2325 2330 2235 3340 2345 3350 2255 2350 
****** 
T ^ TTATTA TAAACTATAA ACTTAAGGGC AAGGAGTTTA TTACAATGTA TCTTTATTAA 
ATCTAATAAT ATTTGATATT TGAATTCCCG TTCCTCAAAT AATGTTACAT AGAAATAATT 

3365 3370 3375 3380 3385 3390 2395 3400 3405 3410 "415 3420 
* 

AACAAAAG GG TGTATAGTGT TCACAAACTG 7GAAAATAGT GTAAGAACTG TACATTGTGA 
TTGTTTTCCC ACATATCACA AGTGTTTGAC ACTTTTATCA C AT TCT TC AC ATGTAACACT 

3425 3430 3435 3440 3445 3450 3455 3460 3465 3470 2475 3480 



rA TTTTTCTCTT GTACCATAGA AAAATGTATA AAAATTATCA AAAAGCTAAT 
CGAGACCAAT AAAAAGAGAA CATGGTATCT TTTTACATAT TTTTAATAGT TTTTCGAITA 

3485 3490 3495 3500 3505 3510 3515 3520 3525 3530 2535 3540 

* * * # * # 

GTCCAGGGAT ATTGCCTTAT TTCTCTGTAA AAAA7GGAGC TCAGTAACAT AA C T GC TTCT 
CACGTXTCw.A TAACGGAATA AACAGACATT ZTTVkCCTCZ AGTCATTGTA 7TGACGAAGA 

3545 3550 2555 3560 2565 3570 2575 

* * » 

TGGAGCTTTG GAATATTTTA TCCTGTATTC TTCTTT (SEQ ID NO: 9) 
ACCTCGAAAC CTTATAAAAT AGGACATAAG AACAAA 

FIG. 8F 
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10 20 30 40 50 60 

GGAAACGCAC CCTCCTGTAG GTGAAAATTC TCTTCACTAC CTGGCCACCT CGCCTGACTG 
CCTTTCCCTC GGAGGACATC CACTTTTAAG ACAAGTGATG GACCGGTGGA CCGGACTGAC 

70 30 90 100 HO 120 

ACCTTCACAG CTTCATCATC TTCCTGAAGA GGCATTCAGG ATTCCCTCCA TCCCTACCCC 
TGGAAGTGTC GAACTAGTAG AAGGACTTCT CCGTAAGTCC TAAGGGAGGT AGGOATCGGG 

130 140 150 160 170 180 

TTCTGGACAA AGTCTTCCAC GTTTCCTTCC TGGGAGTTTC TTCCAGGAAC TGGAGATACC 
AAGACCTCTT TCAGAAGGTG CAAAGCAAGG ACCCTCAAAG AAGGTCCTTG ACCTCTATGG 

190 200 210 220 230 240 

* * * * * • • • • » « « 

CAGAGCCCTG CAACTCCCAC TGGCCAACGA TGGGGGCAGC CGCTCACCAT CCTCAGAGAG 
GTCTCGGGAC GTTGAGGGTG ACCGGTTGCT ACCCCCGTCG GCGAGTGGTA GGAGTCTCTC 

2S0 260 270 280 290 





♦ 


• 




* 


* 


* 






* 


* 




# 




• 




CTCCCCACAG CACCCTACAC CCCCCACCCG GCCCCGCCAC 


ATG 


CTG 


GGG 


CTC 


CCA 


GAGGGGTGTC GTGGGATGTG GGGGG* 


rGGGC CGGGGCGGTG 


TAC 


GAC 


CCC 


GAG 


GGT 
























Met 


Leu 


Gly 


Leu 


Pro 


300 






310 






320 




330 






340 










• 


• 




• 




♦ 


• 




• 




* 


* 




TCA 


ACC 


TTG 


TTC 


ACA 


CCG 


CGC 


AGT 


ATG 


GAG 


AGC 


ATC 


GAG 


ATT 


GAC 


CAG 


AGT 


TGG 


AAC 


AAG 


TGT 


GGC 


GCG 


TCA 


TAC 


CTC 


TCG 


TAG 


CTC 


TAA 


CTG 


' GTC 


Ser 


Thr 


Leu 


Phe 


Thr 


Pro Arg 


Ser 


Met 


Glu 


Ser 


He 


Glu 


lie 


Asp 


Gln> 




350 




360 






370 






380 




390 


* 




* 


• 








• 


* 




* 




* 


• 




* 


AAG 


CTG 


CAG 


GAG 


ATC 


ATG 


AAG 


CAG 


ACA 


GGG 


TAC 


CTG 


ACT 


ATC 


GGG 


GGC 


T7C 


GAC 


GTC 




TAG 


TAC 




GTC 


TGT 


CCC 


ATG 


GAC 


TGA 


TAG 


CCC 


CCG 


Lys 


Leu 


Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr 


oly 


Tyr 


Leu 


Thr 


Xla 


Gly Gly> 






400 






410 




420 






430 










* 






• 






• 




* 




* 


* 




* 




CAG 


CGT 


TAT 


CAG 


GCA 


GAA 


ATC 


AAT 


GAC 


TTG 


GAG 


AAC 


TTG 


GGT 


GAG 


ATG 


GTC 


GCA 


ATA 


GTC 


CGT 


CTT 


TAG 


TTA 


CTG 


AAC 


CTC 


TTG 


AAC 


CCA 


CTC 


TAC 


Gin 


Arg 


Tyr Gin 


Ala 


Glu 


Ha 


Asn 


Asp 


Leu 


Glu 


Agn- 


Leu 


Gly 


GlU 


Mao 


440 




450 






460 






470 




480 






• 






« 




• 


• 




* 




• 


• 




* 




• 


GGC 


AGT 


GGT 


ACC 


TGT 


GGT 


CAG 




TGG 


AAG 


ATG 


CGG 


TTC 


CGG 


AAG 


ACA 


CCG 


TCA 


CCA 


TGG 


ACA 


CCA 


GTC 


cxc 


ACC 




TAC 


CCC 


AAG 




TTC 


TGT 


Gly 


Ser 


Gly Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Mac 


Arg 


Phe 


Arg 


Lya 


Thr> 


490 






500 




510 






520 






530 




* 




* 




• 


• 




• 






# 




• 




* 


♦ 


GGC 


CAC 


ATC 


ATT 


GCT 




AAG 


CAA 


ATG 


CGG 


CGC 


TCT 


GGG 


AAC 


AAG 


GAA 


CCG 


GTG 


TAG 


TAA 


CCA 


CAA 




GTT 


TAC 


****** 


GCG 


AGA 


CCC 


****** 


TTC 


CTT 


Gly 


His 


He 


He 


Ala 


Val 


Lys Gin 


Met 


Arg 


Arg 


Ser 


Gly Asn 


Lys 


Glu 
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SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCTVUS98/14101 



MKK7 33/54 



540 






SSO 






560 




570 






530 




* . 




* 


• 






« 


* 


• 




* 


• 




GAG AAT 


AAG 




ATT 


TTG 


ATG 


GAC CTG 


GAT 


GTA GTA 


CTC 


AAG 




CAT 


CTC TTA 


TTC 


GCG 


TAA 


AAC 


TAG 


CTG GAC 


CTA 


CAT CAT 


GAG 




****** 
4WS1 


GTA 


Glu Asn 


Lys 


Arg 


lie 


Leu 


Met 


Asp Leu 


Asp 


Val Val 


Leu 




Ser 


His> 





590 




600 






610 






620 




630 


* 


* 


• 


* 




• 


• 




* 


• 


• 


• 


GAC 


TGC CCT 


TAC 


ATC GTT 


CAG 


AW- 


TTT 


GGC 


ACC 


TTC ATC 


ACC 


AAC ACA 


CTG 


ACG GGA 


ATG 


TAG CAA 


GTC 




AAA 


CCG 


TGG 


AAG TAG 


TGG 


TTG TGT 


Asp 


Cys Pro 


Tyr 


He Val 


Gin 


Cys 


Phe 


Gly 




Phe lie 


Thr 


Ass Thr> 



640 650 
* * • * * 

GAC GTC TTT ATT GCC ATG GAG CTC 
• CTG CAG AAA TAA CCG TAC CTC GAG 
Asp Val Phe He Ala Met Glu Leu 



660 670 

* # • * 

ATG GGC ACA TGT OCA GAG. AAG CTO 
TAC CCG TGT ACA CGT CTC TTC GAC 
Mac Gly Thr Cys Ala Glu Lys Leu> 



680 690 700 

« ♦ ♦ * * 

AAG AAA CGA ATG CAG GGC CCC ATT 

TTC TTT GCT TAC GTC CCG GGG TAA 

Lys Lys .Arg Wee Gin Gly Pro He 



710 720 
♦ » * » + 

CCA GAG CGA ATC CTG GGC AAG ATG 
GGT CTC GCT TAG GAC CCG TTC TAC 
Pro Glu Arg lie Leu Gly Lys Mao 



730 740 750 

* « • • #• 

ACT GTG GCG ATT GTC AAA GCA CTG 
TGA CAC CGC TAA CAC TTT CGT GAC 
Thr Val Ala He Val Lys Ala Leu 



760 770 
* * * # » 

TAC TAT CTG AAG GAG AAG CAT GGC 
ATG ATA GAC TTC CTC TTC GTA CCG 
Tyr Tyr Leu Lys Glu Lys His Gly> 



730 790 B00 310 320 

• # * • * « 

GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTG CTA GAT GAG CGG 

CAG TAG GTA GCG CTA CAG TTT GGG AGG TTG TAG GAC GAT CTA CTC GCC 

Val He Kis Arg Asp Val Lys Pro Ser Asn He Leu Leu Asp Glu Arg> 



830 

GGC CAG ATC AAG 
CCG GTC TAG TTC 
Gly Gin He Lys 



340 

CTC TGT GAC TTT 
GAG ACA CTG AAA 
Lau Cys Asp Phe 



850 3 

GGC ATC AGT GGC 
CCG TAG TCA CCG 
Gly He Ser Gly 



10 870 

CGC CTT GTT GAC 
GCG GAA CAA CTG 
Arg Leu Val. Asp> 



830 890 900 910 

* # * ♦ * * * ^ # * 

TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC TAT ATG GCT CCC 
AGG TTT CGG TTT TGT GCC TCA CGA CCG ACA CGA CGG ATA TAC CGA GGG 
Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala Tyr Met Ala Pro> 



920 930 940 

* * # • • 

GAG CGC ATC GAC CCT CCA GAT CCC 

CTC GCG TAG CTG GGA GGT CTA GGG 
Glu Arg He Asp Pro Pro Asp Pro 



950 960 
• * * * # 

ACC AAG CCT GAC TAT GAC ATC CGA 
TGG TTG GGA CTG ATA CTG TAG GCT 
Thr Lys Pro Asp Tyr Asp He Arg> 

FIG. 10B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCTYUS98/14101 



MKK7 

970 980 
• • • 

GCT GAT GTG TGG AGC 
CGA CTA CAC ACC TCG 
Ala Asp Val Trp Ser 



34/54 
990 1000 

* 

CTG GGC ATC TCA CTG GTG 
GAC CCS TAG AGT GAC CAC 
Leu Gly H e S«r Leu Val 



1010 
• * ■ 

GAG CTG CCA ACA CGA 
CTC GAC CGT TGT CCT 
Glu Leu Ala Thr Gly> 



1020 1030 1040 1050 1060 

CAG TTC CCC TAT AAG AAC TCC AAG ACG GAC TTT GAG GTC CTC ACC AAA 
GTC AAG GGG ATA TTC TTG ACG TTC TGC CTG AAA CTC CAG GAG TGG T"^ 
Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu Val Lau Thr Lys> 



1070 1080 logo HOC mo 



* 



* 



GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC ATG GGC TTC TCA 
CAG GAT GTC CCT CTC GGG GGT GAG GAC GGA CCA GTG TAC CCG AAG AGT 
Val Lau Gin Glu Glu Pro . Pro Leu .Leu Pro Gly His Met Gly Phe S*r> 



1120 1130 H40 1150 

* * * • * * • • « 

GGG GAC TTC CAG TCA TTT GTC AAA GAC TGC CTT ACT AAA GAT C^AC AGG 
CCC CTG AAG GTC AGT AAA CAG TTT CTG ACG GAA TGA TTT CTA GTG TCC 
Gly Asp Pha Gin Ser Phe Val Lys Asp Cys Leu Thr Lys Asp His Arg> 



1150 1170 1180 

# • * * * 

AAG AGA CCA AAG TAT AAT AAG CTA 
TTC TCT GGT TTC ATA TTA TTC GAT 
Lys Arg Pro Lys Tyr Asn Lys Lau 



1130 1200 
* * # • * « 

CTT GAA CAC AGC TTC ATC AAG CAC 
GAA CTT GTG TCQ AAG TAG TTC GTG 
Lau Glu His Sar Phe 11a Lys His> 



1210 
* 




• 


1220 
* 


• 


1230 

* 


* 


1240 




12S0 




TAT 
ATA 
Tyr 


CTC 
Glu 


ATA 
TAT 
Xla 


w * w 

GAG CTC 
Lau Glu 


'GTG 
CAC 
Val 


GAT GTC GCG 
CTA CAG CGC 
Asp Val Ala 


TCC 
AGG 
Ser 


* 

TGG 
ACC 
Trp 


TTT AAG 
AAA TTC 
Phe Lys 


• 

GAT GTC 
CTA CAG 
Asp Val 


ATG 
TAC 
Mat> 


1260 
* 




1270 
• * 




1280 
* • 




1290 


1300 




OCO 
CGC 
Ala 


AAG 
Lys 


ACC 
TGG 
Thr 


GAG TCC 
CTC AGG 
Glu Ser 


CCA 
GGT 
Pro 


AGG ACT AGT 
TCC TGA TCA 
Arg Thr Ser 


GGA 
CCT 
Gly 


GTC 
CAG 
Val 


CTG AGT 
GAC TCA 
Lau Ser 


• • 

CAG CAC 
GTC GTG 
Gin His 


CAT 
GTA 
His> 



1310 1320 1330 1340 13S0 136Q 

• * * • • • • * 

CTG CCC TTC TTC ACG TA GCCTCATGGC AGCGGCCAGC CCCCCAGGGG CCCCGGGCCA 

GAC GGG AAG AAG TCC AT CGGAGTACCG TCGCCGGTCG GGGCGTCCCC GGGGCCCGGT 
Lau Pro Pha Phe Arg> 



1370 1380 1390 1400 1410 1420 

• • • • • # * • ^ 
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35/54 

MKK7 

1430 1440 14S0 1460 1470 1480 



7GGACGAC7G CCAAGGAC73 AGGACAGAAA G7AGGGGG77 CCCATCCAGC 7C7GACTCCC 
ACCTGCTGAC GGTTCC7GAC 7CC7G7C77T CA7CCCCCAA GGGTAGGTCG AGACTGAGGG 

1430 1500 1510 1520 1530 1540 

* • • • * ## * * * • 

TGCCTACCAG CTG7GGACAA AAGGGCA7GC 7GG777CTAA 7CCC7CCCAC 7CTGGGG7CA 
ACG0A7GG7C GACACC7G77 77CCCG7ACG ACCAAGGA77 AGGGAGGGTG AGACCCCAGT 

1550 1550 1570 1580 1590 1S00 

* ♦ • * • * * • • * # * 

GCCAGCAG7G 7GAGCCCCAT CCCACCCCGA CAGACACTGT GAACGGAAGA CAGCAGGCCA 

CGG7CG7CAC ACTCGGGGTA GGG7GGGGC7 G7C7G7GACA C7TSCCTTC7 G7CGTCCGG7 

1610 1620 

* ♦ • • 

AAAAAAAAAA AAAAAAAAAA AAA (SEQ ID NO: 17) 
TT T TTTTTTT TTTTTTTTTT 777 



FIG. 10D 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



MKK7b 36/54 
Sequence Range: 1 to 1465 

• ■ . 3 ; . « . » 

CC ACG ACC CCT GCT CCT CCC CCC TCC CAG CGA GCA GCC CTC CAA CT~ CCA 
CG TGC TCC GGA CGA CGA CGG CGC AGG GTC GCT CGT CGG GAC GTT GAG GOT 
Thr Ser Pro A^a Pro Ala Pro Ser Gin Arg Ala Ala Leu Gin Leu P^e> 



50 70 
* * * * t 

CTG GCC AAC GAT GGG GGC AGC CGC T: 
GAC CGG TTG CTA CCC CCG TCG GCG AC 
Leu Ala Asa Asp Gly Gly Ser Arg Si 



80 90 
■ • * • # 

^ CCA TCC TCA GAG AGC TCC CCA 
>7 GGT AGO AGT CTC TCG AGG GGT 
ir Pro Ser Ser Glu Ser Ser Pro> 



100 110 120 

* ♦ • • « 

CAG CAC CCT ACA CCC CCC ACC CGG 
GTC GTG GGA TGT GGG GGG TGG GCC 
Gin, His Pro Thr Pro Pro Thr Arg 



130 140 

CCC CGC CAC ATG CTG GGG CTC CCA 
GGG GCG GTG TAG GAC CCC GAG GGT 
Pro Arg Kis Met Leu Gly Leu Pro> 



ISO 150 170 

» * ♦ * * 

TCA ACC TTG TTC ACA CCG CGC AGT 
AGT TGG AAC AAG TGT GGC GCG TCA 
Ser Thr Leu Phe Thr Pro Arg s«r 



130 iso 
* * * * 

ATG GAG AGC ATC GAG ATT GAC CAG 
TAC CTC TCG TAG CTC TAA CTG GTC 
Man Glu Ser lie Glu He Asp Gln> 



200 
* 



210 



220 



230 



240 



JJS £25 22° *J£ *™ J£2 ^ *=* 030 TAC «° ACT ATC GOG GGC 

TTC GAC. GTC CTC TAG TAC TTC GTC TGT CCC ATG GAC TGA TAG CCC CCG 
Ly» Lau Gla Glu XI* Mec t.y. Gin Thr Gly Tyr Sr lit G^> 



2S0 2S0 

• « * • w 

CAG CGT TAT CAG GCA GAA ATC AAT 
GTC GCA ATA GTC CGT CTT TAG TTA 
Gin Arg Tyr -Gla Ala Glu He Asa 



270 280 290 

• * * * • 

GAC TTG GAG AAC TTG GGT GAG ATG 
CTG AAC CTC TTG AAC CCA CTC TAC 
Asp Leu Glu Asa Leu Gly Glu Mtt> 



300 310 
* * * * 

GGC AGT GGT ACC TGT GGT CAG GTG 
CCG TCA CCA TGG ACA CCA GTC CAC 
Gly Ser Gly Thr Cys Gly Gin Val 



320 330 
* * • » * 

TGG AAG ATG CGG TTC CGG AAG ACA 
ACC TTC TAC GCC AAG GCC TTC TGT 
Trp Lya Wet Arg Phe Arg .Lys Thr> 



340 3S0 350 

• • « * # 

GGC CAC ATC ATT CCT GTT AAG CAA 
CCG GTG TAG TAA CGA CAA TTC GTT 
Gly Kis lie He Ala Val Lya Gla 



370 380 
» » * * # 

ATG CGG CGC TCT GGG AAC AAG GAA 
TAC GCC GCG AGA CCC TTG TTC CTT 
Met Arg Arg Ser Gly Asa Lys Glu> 



390 400 4io 

* # * * * 

GAG AAT AAG CGC ATT TTG ATG GAC 

CTC TTA TTC GCG TAA AAC TAC CTG 
Glu Asa Lys Arg lie Lau Met Aap 



420 430 
* * • * 

CTG GAT GTA GTA CTC AAG AGC CAT 
GAC CTA CAT CAT GAG TTC TCG GTA 
Leu Asp Val Val Leu Lys Ser His> 

FIG. 11A 
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37/54 

MKK7b 

440 450 45 0 



470 4 80 



gac tgc ccr tag Arc err cag tgc ttt ggc xcc vrc • * 

CTO ACC GGA ATG TAG CAA GTC ACG AAA C~ feS AAG tIg TGG £c TST 
Asp Cy. Pro Tyr II. Val 61a Cys Ph. Gly Thr Phe lie HI t?J> 

. "J . 5 °? . 510 520 530 

* * * • • 

GAC GTC TTT ATT GCC ATG GAG CTC ATG GGC ACA TGT OCX e«- 

CTO CAG AAA TAA CCC TAG CTC GAG TAC £c1 TgJ IS S£ ~~ 5? 

X,p Val Phe II, Ala Met Glu t au „.t Gly Thr CyV SJ Su l£ 2S» 



540 ^ 550 sso 57Q 

^ «J *~ £2 2S 2 ^ CGA ATC CTG GGC AAG ATG 

..«. -GCT -AC G._ GGG TAA GGT CTC GCT TAG GAC CC5 TT<- TAC 

by. ty» Arg Kett Gin Gly Pro II, p„ olu ^ - lle Leu *J £J ^ 



580 590 600 



610 620 



ACT GTG GCG ATT GTG AAA GCA CTG TAC TAT CTG AAG GAG AAG CAT GCC 
TGA CAC CGC TAA CAC TTT CGT GAC ATG' ATA GAC TTC CTC cti rS* 
Thr Val Ala II. Val Lys Ala Leu Tyr Tyr* 2£ ™ S £ 2 §Iy> 



630 640 sso 



660 670 



GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTQ CTA r.i-i nr m. 
CAG TAG OTA GCG CTA CAG TTT GGG AGG TTG TAG GAC GAT CTA CTC GCC 
Val lie Hi. Arg A.p Val Ly. Pro s« r A , n lla 2p gS £g> 



680 690 7Q0 



710 720 



GGC CAG ATC AAG CTC TGT GAC TTT. GGC ATC ACT GCC CGC CTT GTT CAC 
CCG GTC TAG TTC GAG ACA CTG AAA CCG TAG TCA CCG GCG GAA CAA CTG 
Gly Gl* lie Lys Leu Cy. Asp Phe Gly lla s« Gly Arg Su Vat 



730 740 7so 



* 



760 770 



I" *£ J* ^ 522 *2? GCT ^ TGT GCC TAT ATG GCT CCC 

AGG TTT CGG TTT TGT GCC TCA' CGA CCG ACA CGA COG ATA TAC COA GGG 
S.r Ly, Ala Ly, Thr Arg Ser Ala Gly Cy, Ala xil Tyr Ket 5a* S> 



730 790 



300 B10 



GAG CGC ATC GAC CCT CCA GAT CCC ACC AAG CCT GAC TAT GAC ATC CGA 
CTC GCG TAG CTG GGA GGT CTA GGG TGG TTC GGA CTG ATA CTG TAG GCT 
Glu Arg II. Asp Pro Pro A.p p ro Thr Lye Pro Aap Tyr Asp lie SJ> 

FIG. 11B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



MKK7b 38/54 

820 330 340 850 860 



GCT GAT GTG TGG AGC CTG GGC ATC TCA CTG GTG GAG CTG GCA ACA GGA 
CGA CTA CAC ACC TCG GAC CCG TAG AGT GAC CAC CTC GAC CGT TGT CCT 
Tr ~ Ser Giy He Ser Leu Val Glu Leu Ala Thr Gly> 



Ala Aap Val 



870 880 890 900 



9X0 



CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG GTC CTC ACC AAA 
GTC AAG GGG ATA TTC TTG ACG TTC TGC CTG AAA CTC CAG GAG TGG TTT 
Gla Phe Pro Tyr Lys Asn Cys Lys Thr Asp Pha Glu Val Leu Thr Lya> 



920 330 940 950 960 

GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC ATG GGC TTC TCA 
CAG GAT GTC CTT CTC GGG GGT GAG GAC GGA CCA GTG TAC CCG AAG AGT 
Val Leu Gin Glu Glu Pre Pro Leu Leu Pro Gly His Mat Gly Pha Sar> 



970 980 
• ♦ * « « 

GGG GAC TTC CAG TCA TTT GTC AAA 
CCC CTG AAG GTC AGT AAA CAG TTT 
Gly Asp Pha Gin Ser Pha Val Lys 



990 1000 1010 

» * # * * 

GAC TGC CTT ACT AAA GAT CAC AGG 

CTG ACG GAA TGA TTT CTA GTG TCC 

Aap Cys Leu Thr Lya Aap Kia Arg> 



1020 1030 1040 1050 

• ♦ ♦ 

AAG AGA CCA AAG TAT AAT AAG CTA CTT GAA CAC AGC TTC ATC AAG CAC 
TTC TCT GGT TTC ATA TTA TTC GAT GAA CTT GTG TCG AAG TAG TTC GTG 
Lya Arg Pro Lya Tyr Asn Lya Leu Leu Glu Hia Ser Phe He Lya Hia> 



1060 1070 
• ♦ ♦ 

TAT GAG ATA CTC 
ATA CTC TAT GAG 
Tyr Glu Ha. Lau 



1080 
* • 

GAG GTG GAT GTC 
CTC CAC CTA CAG 
Glu Val Asp Val 



1030 

GCG TCC TGG TTT 
CGC AGG ACC AAA 
Ala Ser Trp Pha 



1100 
• * * 

AAG GAT GTC ATG 
TTC CTA CAG TAC 
Lya Aap Val Mao 



1110 1120 U30 

* * » • « 

GCG AAG ACC GAG TCC CCA AGG ACT 
CGC TTC TGG CTC AGG GGT TCC TGA 
Ala Ly8 Thr Glu Ser Pro Arg Thr 



1140 H50 
♦ * ★ * 

AGT GGA GTC CTG AGT CAG CAC CAT 
TCA CCT CAG GAC TCA GTC GTG GTA 
Ser Gly Val Leu Sar Gin Hia Hia> 



1160 1170 1180 H90 1200 1210 

CTG CCC TTC TTC AGG T AGCCTCATGG CAGCGGCCAG CCCCGCAGGG GCCCCGGGCC 

OAC GGG AAG AAG TCC A TOGW^C GTCGCCGGTC GGCCCGTCCC CGGGGCCCGG 
Lau Pro Phe Phe Arg> (SEQ ID NO: 20) 



1220 1230 1240 

* * * • • * 

ACGGCCACCG ACCCCCCCCC CAACCTGGCC 
TGCCGGTGGC TGGGGGGGGG GTTGGACCGG 

1280 1290 1300 

* * * * * * 

CTGGACGACT GCCAAGGACT OAGGACAGAA 

GACCTGCTGA CGGTTCCTGA CTCCTGTCTT 



1250 1260 1270 

• ♦ * * + * 

AACCCAGCTG CCCATCAGGO GACCTGGGAC 
TTGGGTCGAC GGGTAGTCCC CTGOACCCTG 

1310 1320 1330 

• * * # • « 

AGTAGGGGGT TCCCATCCAG CTCTGACTCC 

TCATCCCCCA AGGGTAGGTC GAGACTGAGG 

FIG. 11C 
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1340 1350 1360 1370 1380 1390 

♦ * * ♦ • • *« * * 

CTGCCTACCA CCTGTGGACA AAAGGGCATG C7SGTTCCTA ATCCCTCCCA CTCTGGGGTC 
GACGGATGGT CGACACC7GT TTTCCCGTAC GACCAAGGAT TAGGGAGGGT GAGACCCCAG 

1400 1410 1420 1430 1440 1450 

AGCCAGCAGT GTGAGCCCCA TCCCACCCCG ACAGACACTG TGAACGGAAG AGAGCAAAAA 
TCGGTCGTCA CACTCGGGGT AGGGTGGGGC TGTCTGTGAC AC7TGCCTTC 7GTCG77TT? 

1460 

• » • 

AAAAAAAAAA AAAAA (SEQ ID NO: 19) 

TTTTrrrrr r ttttt 



FIG. 11D 
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Human MKK7 40/54 
Sequence Range: 1 to 843 

. X ! . 2 °. . 3 ? . 4 ? .5? 60 

TGTTTGTCTG CCGGACTGAC GGGCGGCCGG GCGGTGCGCG GCGGT»-rr- * 
ACAAACAGAC GGCCTGACTG CCCGCCGGCC ^SS^ g£3£ 

# ™ # 80 # 90 10 o 

G ATG GCG GCG TCC TCC CTG GAA CAG AAG CTG TCC CGC CTG Gix nrz *^ 

c Z"; c 00 AGG GAC CTT GTC TTC GAc S S 

Met Ala Ala Ser Ser Leu Glu Gin Lys Leu Ser Arg Leu g£ u S £s> 



110 120 130 

* * * * * 

CTG AAG CAG GAG AAC CGG GAG GCC 
GAC TTC GTC CTC TTG GCC CTC CGG 
Leu Lys Gin Glu Asn Arg Glu Ala 



140 ISO 

CGG CGG AGG ATC GAC CTC AAC CTG 
GCC GCC TCC TAG CTG GAG TTG GAC 
Arg Arg Arg He Asp Leu Asn Leu> 



160 170 180 

* * * * ♦ 

GAT ATC AGC CCC CAG CGG CCC AGG 
CTA TAG TCG GGG GTC GCC GGG TCC 
Asp He Ser Pro Gin Arg Pro Arg 



190 200 

CCC ACC CTG CAG CTC CCG CTG GCC 
GGG TGG GAC GTC GAG GGC GAC CGG 
Pro Thr Leu Gin Leu Pro Leu Ala> 



210 220 230 



240 250 



AAC GAT GGG GGC AGC CGC TCG CCA TCC TCA GAG AGC TCC CCG CAG CAC 
TTC CTA CCC CCG TCG GCG AGC GGT AGG AGT CTC TCG AGG GGC GTC GTG 
Asn Asp Gly Gly Ser Arg Ser Pro Ser Ser Glu Ser Ser S £n SsV 



260 270 
***** 

CCC ACG CCC CCC GCC CGG CCC CGC 
GGG TGC GGG GGG CGG GCC GGG GCG 
Pro Thr Pro Pro Ala Arg Pro Arg 



280 290 300 

CAC ATG CTG GGG CTC CCG TCA ACC 
GTG TAC GAC CCC GAG GGC AGT TGG 
His Met Leu Gly Leu Pro Ser Thr> 



310 320 
* * * * # 

CTG TTC ACA CCC CGC AGC ATG GAG 
GAC AAG TGT GGG GCG TCG TAC CTC 
Leu Phe Thr Pro Arg Ser Met Glu 



330 3 4 o 

AGC ATT GAG ATT GAC CAG AAG CTG 
TCG TAA CTC TAA CTG GTC TTC GAC 
Ser He Glu He Asp Gin Lys Leu> 



350 360 370 

* * * * + 

CAG GAG ATC ATG AAG CAG ACG GGC 
GTC CTC TAG TAC TTC GTC TGC CCG 
Gin Glu He Met Lys Gin Thr Gly 



380 390 

TAC CTG ACC ATC GGG GGC CAG CGC 
ATG GAC TGG TAG CCC CCG GTC GCG 
Tyr Leu Thr He Gly Gly Gin Arg> 



400 410 «0 430 a*« 

* * * * # # ^ 440 

TAC CAG GCA GAA ATC AAC GAC CTG GAG jur TVr , 

ATC GTC CGT CTT TAG TTG CTG Ga2 SS JJS 22 S SS £2 S £ 

Tyr Gin Ala Glu He Asn Asp Leu Glu Asn Leu Gly Su Met £y lll> 

FIG. 12A 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



Human MKK7 41/54 
450 460 47Q 



480 490 



555 C TGC GGC CAG GTG TGG AAG ATG CGC TTC CGG AAG ACC GGC CAC 
CCG TGG ACG CCG GTC CAC ACC TTC TAC GCG AAG GCC TTC TGG C~G GTG 
Gly Thr Cys Gly Gin Val Trp Lys Met Arg Phe Arg Lys S2 Gly His> 



500 510 
* * • * • 

GTC ATT GCC GTT AAG CAA ATG CGG 
CAG TAA CGG CAA TTC GTT TAC GCC 
Val lie Ala Val Lys Gin Met Arg 



520 530 540 

CGC TCC GGG AAC AAG GAG GAG AAC 
GCG AGG CCC TTG TTC CTC CTC TTG 
Arg Ser Gly Asn Lys Glu Glu Asn> 



550 560 

* * * * ★ 

AAG CGC ATC CTC ATG GAC CTG GAT 

TTC GCG TAG GAG TAC CTG GAC CTA 

Lys Arg lie Leu Met Asp Leu Asp 



570 580 

* * * • 

GTG GTG CTG AAG AGC CAC GAC TGC 
CAC CAC GAC TTC TCG GTG CTG ACG 
Val Val Leu Lys Ser His Asp Cys> 



590 600 
* * * 

CCC TAC ATC GTG 
GGG ATG TAG CAC 
Pro Tyr He Val 



610 
* * 

CAG TGC TTT GGG 
GTC ACG AAA CCC 
Gin Cys Phe Gly 



620 

* * * 

ACG TTC ATC ACC 
TGC AAG TAG TGG 
Thr Phe lie Thr 



630 

* * 

AAC ACG GAC GTC 
TTG TGC CTG CAG 
Asn Thr Asp Val> 



640 650 660 

* * * * * 

TTC ATC GCC ATG GAG CTC ATG GGC 
AAG TAG CGG TAC CTC GAG TAC CCG 
Phe lie Ala Met Glu Leu Met Gly 



670 680 
* * * * # 

ACC TGC GCT GAG AAG CTC AAG AAG 
TGG ACG CGA CTC TTC GAG TTC TTC 
Thr Cys Ala Glu Lys Leu Lys Lys> 



690 700 7 

* * * * 

CGG ATG CAG GGC CCC ATC CCC GAG 
GCC TAC GTC CCG GGG TAG GGG CTC 
Arg Met Gin Gly Pro He Pro Glu 



-0 720 730 

CGC ATT CTG GGC AAG ATG ACA GTG 
GCG TAA GAC CCG TTC TAC TGT CAC 
Arg He Leu Gly Lys Met Thr Val> 



740 750 
***** 

GCG ATT GTG AAG GCG CTG TAC TAC 
CGC TAA CAC TTC CGC GAC ATG ATG 
Ala He Val Lys Ala Leu Tyr Tyr 



760 770 780 

***** 

CTG AAG GAG AAG CAC GGT GTC ATC 
GAC TTC CTC TTC GTG CCA CAG TAG 
Leu Lys Glu Lys His Gly Val Ile> 



790 800 
* * * * # 

CAC CGC GAC GTC AAG CCC TCC AAC 
GTG GCG CTG CAG TTC GGG AGG TTG 
His Arg Asp Val Lys Pro Ser Asn 



810 820 

* » * * 

ATC CTG CTG GAC GAG CGG GGC CAG 
TAG GAC GAC CTG CTC GCC CCG GTC 
He Leu Leu Asp Glu Arg Gly Gln> 



830 



840 



ATC AAG CTC TGC GA (SEQ ID NO: 25) 

TAG TTC GAG ACG CT 

He Lys Leu Cys> (SEQ ID NO: 26) 



FIG. 12B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



Mouse MKK7c 42/54 
Sequence Range: 1 to 1643 

10 20 30 40 50 SO 

AGCGCAGGCG CAGTGCGGTG TTTGTCTACC CCGGACTGAC GGGTGGCCTG GCGGTGAGCG 
TCGCGTCCGC GTCACGCCAC AAACAGATGG GGCC7GACTG CCCACCGGAC ^SSSc 



70 80 90 100 



110 

* 



GCGGCAGCGG CGGCGGGGAA G ATG GCG GCG TCC TCC CTG GAG CAG AAG C~G 
CGCCGTCGCC GCCGCCCCTT C TAC CGC CGC AGG AGG GAC CTC GTC ^C G^C 

Met Ala Ala Ser Ser Leu Glu Gin Lys Leu> 



120 

TCC CGC CTG GAA 
AGG GCG GAC CTT 
Ser Arg Leu Glu 



130 

* * * 

GCC AAG CTG AAG 
CGG TTC GAC TTC 
Ala Lys Leu Lys 



140 

CAG GAG AAC CGT 
GTC CTC TTG GCA 
Gin Glu Asn Arg 



150 

* * 

GAG GCC CGC AGG 
CTC CGG GCG TCC 
Glu Ala Arg Arg> 



160 170 130 

* * * * * 

AGG ATC GAC CTC AAC TTG GAT ATC 
TCC TAG CTG GAG TTG AAC CTA TAG 
Arg lie Asp Leu Asn Leu Asp He 



190 200 

AGC CCA CAG CGG CCC AGG CCC ACC 
TCG GGT GTC GCC GGG TCC GGG TGG 
Ser Pro Gin Arg Pro Arg Pro Thr> 



210 220 230 

***** 

CTG GAA CTC CCA CTG GCC AAC GAT 
GAC GTT GAG GGT GAC CGG TTG CTA 
Leu Gin Leu Pro Leu Ala Asn Asp 



240 250 

* 

GGG GGC AGC CGC TCA CCA TCC TCA 
CCC CCG TCG GCG AGT GGT AGG AGT 
Gly Gly Ser Arg Ser Pro Ser Ser> 



260 270 280 290 



300 



GAG AGC TCC CCA CAG CAC CCT ACA CCC CCC ACC CGG CCC CGC CAC ATG 
CTC TCG AGG GGT GTC GTG GGA TGT GGG GGG TGG GCC GGG GCG GTG TAC 
Glu ser Ser Pro Gin His Pro Thr Pro Pro Thr Arg Pro Arg h2 Met> 



310 320 
***** 

CTG GGG CTC CCA TCA ACC TTG TTC 
GAC CCC GAG GGT AGT TGG AAC AAG 
Leu Gly Leu Pro Ser Thr Leu Phe 



330 340 35 0 

* * * 

ACA CCG CGC AGT ATG GAG AGC ATC 
TGT GGC GCG TCA TAC CTC TCG TAG 
Thr Pro Arg Ser Met Glu Ser Ile> 



360 
* * 

GAG ATT GAC CAG 
CTC TAA CTG GTC 
Glu He Asp Gin 



370 

* * * 

AAG CTG CAG GAG 
TTC GAC GTC CTC 
Lys Leu Gin Glu 



380 

* * 

ATC ATG AAG CAG 
TAG TAC TTC GTC 
lie Met Lys Gin 



390 

* * 

ACA GGG TAC CTG 
TGT CCC ATG GAC 
Thr Gly Tyr Leu> 



400 410 420 

* * * * + 

ACT ATC GGG GGC CAG CGT TAT CAG 
TGA TAG CCC CCG GTC GCA ATA GTC 
Thr He Gly Gly Gin Arg Tyr Gin 



430 440 

GCA GAA ATC AAT GAC TTG GAG AAC 
CGT CTT TAG TTA CTG AAC CTC TTG 
Ala Glu He Asn Asp Leu Glu Asn> 

FIG. 13A 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



Mouse MKK7c 43/54 

4S * * 4 " * * "2 . 49 ? 

TTC GGT GAG ATG GGC AGT GOT ACC TGT GGT CAG GTG TGG AAG ATG CG^ 
AAC CCA CTC TAG CCG TCA CCA TGG ACA CCA GTC CAC ACC He S 

Leu Gly Glu Met Gly Ser Gly Thr Cys Gly Gin 2? J£ Z* Met £g> 



500 510 520 



530 540 



TTC CGG AAG ACA GGC CAC ATC ATT GCT GTT AAG CAA ATG CGG rr^ -~r* 

AAG GCC TTC TGT CCG GTG TAG TAA CGA CAA TTC GTT TAC cS GCG 

Phe Arg Lys Thr Gly His He He Ala 5at £s gS 2S £g £g s2> 



550 560 
* * * 

GGG AAC AAG GAA GAG AAT AAG CGC 
CCC TTG TTC CTT CTC TTA TTC GCG 
Gly Asn Lys Glu Glu Asn Lys Arg 



570 580 590 

***** 

ATT TTG ATG GAC CTG GAT GTA GTA 
TAA AAC TAC CTG GAC CTA CAT CAT 
He Leu Met Asp Leu Asp Val Val> 



600 610 

CTC AAG AGC CAT GAC TGC CCT TAC 
GAG TTC TCG GTA CTG ACG GGA ATG 
Leu Lys Ser His Asp Cys Pro Tyr 



620 630 

* * * * 

ATC GTT CAG TGC TTT GGC ACC TTC 
TAG CAA GTC ACG AAA CCG TGG AAG 
He Val Gin Cys Phe Gly Thr Phe> 



640 650 660 

* * * * * 

ATC ACC AAC ACA GAC GTC TTT ATT 
TAG TGG TTG TGT CTG CAG AAA TAA 
He Thr Asn Thr Asp Val Phe He 



670 680 

* * * * 

GCC ATG GAG CTC ATG GGC ACA TGT 
CGG TAC CTC GAG TAC CCG TGT ACA 
Ala Met Glu Leu Met Gly Thr Cys> 



690 700 710 

***** 

GCA GAG AAG CTG AAG AAA CGA ATG 
CGT CTC TTC GAC TTC TTT GCT TAC 
Ala Glu Lys Leu Lys Lys Arg Met 



720 730 
* * * * * 

CAG GGC CCC ATT CCA GAG CGA ATC 
GTC CCG GGG TAA GGT CTC GCT TAG 
Gin Gly Pro He Pro Glu Arg Ile> 



740 750 76o 



770 780 



ET? 555 *** ATG ACT GTG ATT GTG AAA GCA CTG TAC TAT CTG AAG 

GAC CCG TTC TAC TGA CAC CGC TAA CAC TTT CGT GAC ATG SI gS £S 
Leu Gly Lys Met Thr Val Ala He Val Lys Ala Leu J£ 25 



790 800 
***** 

GAG AAG CAT GGC GTC ATC CAT CGC 
CTC TTC GTA CCG CAG TAG GTA GCG 
Glu Lys His Gly Val He His Arg 



8 « 820 830 

* 

GAT GTC AAA CCC TCC AAC ATC CTG 
CTA CAG TTT GGG AGG TTG TAG GAC 
Asp Val Lys Pro Ser Asn He Leu> 



840 850 

* * * * 

CTA GAT GAG CGG GGC CAG ATC AAG 
GAT CTA CTC GCC CCG GTC TAG TTC 
Leu Asp Glu Arg Gly Gin He Lys 



860 870 

* * * * 

CTC TGT GAC TTT GGC ATC AGT GGC 
GAG ACA CTG AAA CCG TAG TCA CCG 
Leu Cys Asp Phe Gly He Ser Gly> 

FIG. 13B 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



44/54 

Mouse MKK7c 
830 890 900 



910 920 



AT, v.l Asp Ser x. ys „. Lys ^ % g= J£ S SL 



^30 940 950 

* 

TAT ATG GCT CCC GAG CGC ATC GAC 
ATA TAC CGA GGG CTC GCG TAG CTG 
Tyr Met Ala Pro Glu Arg He Asp 



960 970 

* 

CCT CCA GAT CCC ACC AAG CCT GAC 
GGA GGT CTA GGG TGG TTC GGA CTG 
Pro Pro Asp Pro Thr Lys Pro Asp> 



980 990 io 

* * * 

TAT GAC ATC CGA GCT GAT GTG TGG 
ATA CTG TAG GCT CGA CTA CAC ACC 
Tyr Asp He Arg Ala Asp Val Trp 



,0 1010 1020 

* 

AGC CTG GGC ATC TCA CTG GTG GAG 
TCG GAC CCG TAG AGT GAC CAC CTC 
Ser Leu Gly He Ser Leu Val Glu> 



1030 1040 
* * * * 

CTG GGA ACA GGA CAG TTC 
GAC CGT TGT CCT GTC AAG 
Leu Ala Thr Gly Gla Phe 



1050 
* * * 

CCC TAT AAG AAC TGC 
GGG ATA TTC TTG ACG 
Pro Tyr Lys Asn Cys 



1060 1070 
* * * 

AAG ACG GAC TTT GAG 
TTC TGC CTG AAA CTC 
Lys Thr Asp Phe Glu> 



1080 1090 

GTC CTC ACC AAA GTC CTA CAG GAA 
CAG GAG TGG TTT CAG GAT GTC CTT 
Val Leu Thr Lys Val Leu Gin Glu 



1100 mo 

* * • * 

GAG CCC CCA CTC CTG CCT GGT CAC 
CTC GGG GGT GAG GAC GGA CCA GTG 
Glu Pro Pro Leu Leu Pro Gly His> 



-120 H30 H4o 

ATG GGC TTC TCA GGG GAC TTC CAG 
TAC CCG AAG AGT CCC CTG AAG GTC 
Met Gly Phe Ser Gly Asp Phe Gin 



1150 H60 

* 

TCA TTT GTC AAA GAC TGC CTT ACT 
AGT AAA CAG TTT CTG ACG GAA TGA 
Ser Phe Val Lvs Asp Cys Leu Thr> 



1170 H80 H90 

***** 

AAA GAT CAC AGG AAG AGA CCA AAG 
TTT CTA GTG TCC TTC TCT GGT TTC 
Lys Asp His Arg Lys Arg Pro Lys 



1200 1210 
* 

TAT AAT AAG CTA CTT GAA CAC AGC 
ATA TTA TTC GAT GAA CTT GTG TCG 
Tyr Asn Lys Leu Leu Glu His Ser> 



l22 °. . " 3 ? . "40 ^ 12S o 1260 

TTC ATC AAG CAC TAT GAG ATA CTC GAG GTG GAT GTC r~r Inn t~ 



1270 _ 1 28 0 _ u ,o 1300 U1( , 

* * 
JT GGA GTC CTC 
^A CCT CAG GAC 

FIG. 13C 



AAG GAT GTC ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GQ& GTC CTG 
TTC CTA CAG TAC CGC TTC TGG CTC AGG GGT TCC TGA TCA CCT CAG GAC 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



45/54 

Mouse MKK7c 

Lys Asp Val Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val Leu> 



"20 1330 1340 1350 



1350 



ACT CAG CAC CAT CTS CCC TTC TTC AGG 7A GCCTCATGGC AGCGGCCAGC 
TCA GTC GTG GTA GAC GGG AAG AAG TCC AT CGGAGTACCG TCGCCGGTCG 
Ser Gin His His Leu Pro Phe Phe Arg> (SEQ ID NO: 28) 



1370 1380 1390 1400 1410 



1420 



CCCGCAGGGG CCCCGGGCCA CGGCCACCGA CCCCCCCCCC AACCTGGCCA ACCCAGCTGC 
GGGCGTCCCC GGGGCCCGGT GCCGGTGGCT GGGGGGGGGG TTGGACCGGT TGGGTCGACG 



1430 1440 1450 1460 1470 



1430 



CCATCAGGGG ACCTGGGACC TGGACGACTG CCAAGGACTG AGGACAGAAA GTAGGGGGTT 
GGTAGTCCCC TGGACCCTGG ACCTGCTGAC GGTTCCTGAC TCCTGTCTTT CATCCCCCAA 



1«0 1500 1510 1520 1530 



1540 



CCCATCCAGC TCTGACTCCC TGCCTACCAG CTGTGGACAA AAGGGCATGC TGGTTCCTAA 
GGGTAGGTCG AGACTGAGGG ACGGATGGTC GACACCTGTT TTCCCGTACG ACCAAGGATT 

1550 1560 1570 1580 1590 1600 

TCCCTCCCAC TCTGGGGTCA GCCAGCAGTG TGAGCCCCAT CCCACCCCGA CAGACACTGT 
AGGGAGGGTG AGACCCCAGT CGGTCGTCAC ACTCGGGGTA GGGTGGGGCT CTCTGTGACA 

1610 1620 1630 1640 

* * »* »* * » 

GAACGGAAGA CAGCAGGCCA AAAAAAAAAA AAAAAAAAAA AAA (SEQ ID NO 271 
CTTGCCTTCT GTCGTCCGGT TTTTTTTTTT TTTTTTTTTT TTT 



FIG. 13D 



SUBSTITUTE SHEET (RULE 26) 
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MKK7d 46/54 
Sequence Range: 1 to 1578 

. " . 2 °. . 30 " 50 60 

SSSS? ^STTCACTAC 0X00^ GGCC^SACTG 

Gv^AGGACATC C^CTTTTAAG ACAAGTGATG GACCGGTGGA CCGGACTGAC 

. ™ . 8 ? 30 100 no 120 

JSSSSSS s= z~ ssss ssss ssks 

130 140 un 

* . # 15 ° # 160 ^ 170 180 

TTCTGGACAA AGTCTTCCAC GTTTCC^»"-« ^-^1*-^^^ * * * 

AAGACC^TT TCAGAAGG7G CAAAGGAAGG ™»§ U££?£g ISS£g 

. "2 . 2 °? . 21 ° 220 230 240 

SScISS? SSSS TGGG< ^ cgct^cca; cctcagagag 

GTTGAGGv^o «ccggttgct acccccgtcg gcgagtggta ggagtctctc 

. "2 . "° . "0 280 290 

SSSSS SSSSS SSSS S S S S £ 

Met Leu Gly Leu Pro> 



• 31 ? # 320 

Ser tS ££S JS S S TAC CTC TCG TAG CTC TAA CTG GTC 

— i-eu pne Thr Pro Arg Ser Met Glu Ser lie Glu He Asp Gln> 



350 360 

* 

AAG CTG CAG GAG ATC ATG AAG CAG 
TTC GAC GTC CTC TAG TAC TTC GTC 
Lys Leu Gin Glu He Met Lys Gin 



37 0 380 390 

ACA GGG TAC CTG ACT ATC GGG- GGC 
TGT CCC ATG GAC TGA TAG CCC CCG 
Thr Gly Tyr Leu Thr He Gly Gly> 



400 410 

CAG CGT TAT CAG GCA GAA ATC AAT 
GTC GCA ATA GTC CGT CTT TAG TTA 
Gin Arg Tyr Gin Ala Glu lie Asn 



420 430 

GAC TTG GAG AAC TTG GGT GAG ATG 
CTG AAC CTC TTG AAC CCA CTC TAC 
Asp Leu Glu Asn Leu Gly Glu Met> 



440 450 
* * * 

GGC AGT GGT ACC 
CCG TCA CCA TGG 
Gly Ser Gly Thr 



460 
* » 

TGT GGT CAG GTG 
ACA CCA GTC CAC 
Cys Gly Gin Val 



470 

* * * 

TGG AAG ATG CGG 
ACC TTC TAC GCC 
Trp Lys Met Arg 



480 

TTC CGG AAG ACA 
AAG GCC TTC TGT 
Phe Arg Lys Thr> 



490 500 S10 

* 

GGC CAC ATC ATT GCT GTT AAG CAA 
CCG GTG TAG TAA CGA CAA TTC GTT 

Gly His He He Ala Val Lys Gin 



520 

ATG CGG CGC TCT GGG AAC AAG GAA 
TAC GCC GCG AGA CCC TTG TTC CT~* 
Met Arg Arg Ser Gly Asn Lys Glu> 

FIG. 14A 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT7US98/14101 



47/54 

MKK7d 

* *" * "5 . 5? ? . 580 

SS £5 TTC ^ GAC "° GTA G ^ A CTC CAT 

ri^ I3« r ?° G AAC TAC CTG GAC CTA ^ CAT GAG TTC TCG GTA 

Glu Asa Lys Arg He Leu Met Asp Leu Asp Val Val Leu Lys Ser Ks> 



590 600 

GAC TGC CCT TAC ATC GTT CAG TGC 
CTG ACG GGA ATG TAG CAA GTC ACG 
Asp Cys Pro Tyr lie Val Gin Cys 



610 620 630 

* * * * • 

TTT GGC ACC TTC ATC ACC AAC ACA 
AAA CCG TGG AAG TAG TGG TTG TGT 
Phe Gly Thr Phe He Thr Asn Thr> 



640 650 

* 

GAC GTC TTT ATT GCC ATG GAG CTC 
CTG CAG AAA TAA CGG TAC CTC GAG 
Asp Val Phe He Ala Met Glu Leu 



660 670 

* ♦ ♦ * 

ATG GGC ACA TGT GCA GAG AAG CTG 
TAC CCG TGT ACA CGT CTC TTC GAC 
Met Gly Thr Cys Ala Glu Lys Leu> 



680 690 700 

***** 

AAG AAA CGA ATG CAG GGC CCC ATT 
TTC TTT GCT TAC GTC CCG GGG TAA 
Lys Lys Arg Met Gin Gly Pro He 



710 720 
* * * * * 

CCA GAG CGA ATC CTG GGC AAG ATG 
GGT.CTC GCT TAG GAC CCG TTC TAC 
Pro Glu Arg He Leu Gly Lys Met> 



? 30 740 750 

ACT GTG GCG ATT GTG AAA GCA CTG 
TGA CAC CGC TAA CAC TTT CGT GAC 
Thr Val Ala He Val Lys Ala Leu 



760 770 
* * * * * 

TAC TAT CTG AAG GAG AAG CAT GGC 
ATG ATA GAC TTC CTC TTC GTA CCG 
Tyr Tyr Leu Lys Glu Lys His Gly> 



^IH £TS CGC GAT GTC AAA CCC TCC AAC ATC CTG CTA GAT GAG CGG 

SS 2! Si Sc S S? T f° AGG m TAG GAC S? So? 

A 118 Hls Arg Val Pro Ser Asn lie Leu Leu Asp Glu Arg> 



830 840 850 

* * » ■ 



860 870 



cS S? T*Ig £2 Sg IS 25 T 7 ATC AGT GGC CGC CTT GTT GAC 

GlZ Gin r A f f* 0 ^ CTG *** CCG TAG TCA CCG CCG GAA CAA CTG 

Gly Gin lie Lys Leu Cys Asp Phe Gly lie Ser Gly Arg Leu Val Asp> 



880 890 



900 910 



AGG TTT CGG TTT TGT GCC Tel ^ TGT GCT ^ C ™ A * G GCT CCC 

Ser Ss IS 131 S « * CCG ACA CGA CGS ATA TAC GGA GGG 
ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala Tyr Met Ala Pro 



920 930 940 950 



* 



960 



FIG. 14B 
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MKK7d 48/54 

Glu Arg He Asp Pro Pro Asp Pro Thr Lys Pro Asp Tyr Asp He Ar*> 
97 ? # 98 ? . "0 ^ 1000 xoio 

S SI S? IS 22 SS TCA " G GT ^ GAG " G ^ ACA ad 

It- ll- ,7T m I" ° CCG TAG AGT «C CAC CTC GAC CGT TGT CCT 
Ala Asp Val Trp Ser Leu Gly He Ser Leu Val Glu Leu Ala tS GlJ> 



102 ? . 103 ? # 1040 1Q50 1060 

«. Ph. Tyr Lys A .„ cy = 



. 107 2 . 108 ? # "90 HOO mo 

35 Si SS ~ S S 25 CT * CCT °™ ^ 

t£? t ^7 5, 5; w GxjG GGT GAG GAC GGA CCA GTG TAC CCG AAG AGT 

Val Leu Gin Glu Glu Pro Pro Leu Leu Pro Gly His Met Gly JJe s2> 



1120 H30 

GGG GAC TTC CAG TCA TTT GTC AAA 
CCC CTG AAG GTC AGT AAA CAG TTT 
Gly Asp Phe Gla Ser Phe Val Lys 



1140 H50 

* * * * 

GAC- TGC CTT ACT AAA GAT CAC AGG 
CTG ACG GAA TGA TTT CTA GTG TCC 
Asp Cys Leu Thr Lys Asp His Arg> 



1160 H70 iiso 

* * ♦ * « 

AAG AGA CCA AAG TAT AAT AAG CTA 
TTC TCT GGT TTC ATA TTA TTC GAT 
Lys Arg Pro Lys Tyr Asn Lys Leu 



1190 1200 
* * * # ♦ 

CTT GAA CAC AGC TTC ATC ATC AAG 
GAA CTT GTG TCG AAG TAG TAG TTC 
Leu Glu His Ser Phe He He Lys> 



1210 1220 1230 

CAC TAT GAG ATA CTC GAG GTG GAT 
GTG ATA CTC TAT GAG CTC CAC CTA 
His Tyr Glu He Leu Glu Val Asp 



1240 1250 
* * * * * 

GTC GCG TCC TGG TTT AAG GAT GTC 
CAG CGC AGG ACC AAA TTC CTA CAG 
Val Ala Ser Trp Phe Lys Asp Val> 



" 6 2 . " 7 ° . "80 # 1290 1300 

2 2 5 S S 2 S 5 2 2 g 2 £ c 2 EL 



* 131 ° . 132 ° . "30 1340 1350 

S525=F ss =ss2ssss 

His Leu Pro Phe Phe Ser Gly Ser Leu Glu Glu Ser Pro Sr Ser Pro> 

. . 137 ° . "80 ^ 1390 

CCT TCT CCC AAG TCC TTC CCT CTG TCA CCA GCC ATC CCT CAG GCC CAG 

FIG. 14C 
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MKK7d 49/54 

GGA AGA GGG TTC AGO AAG GGA GAG AGT GGT CGG TAG GGA GTC CGG GTC 
Pro Ser Pro Lys Ser Phe Pro Leu Ser Pro Ala He Pro Gin Ala Gln> 

14 °? . 141 ° . 1*20 ^ 1430 ^ 1440 1450 

^ ?^ TGG GTC TCG GGC AGG TAGGGACCTG GAGTGGCCTG GTCCCACCCT 
COT CTC ACC CM AGC CCG TCC ATCCCTGGAC CTCACCGGAC CAGGGTGgS 
Ala Glu Trp Val Ser Gly Arg> (SEQ ID NO: 30) 

* 146 ° . 147 ° "80 1490 1500 i 510 

* * * * * * * * 

CTGACCTCCT CCTCAGGCCA CCAGTGTTGC CCTCTTCCCT TTTTAAAAra niw^ 
GACTGGAGGA GGAGTCCGGT GGTCACAACG GGAgIISS AAaIttT^S TT^ItSgaI 

. 152 ° . 152 ° . "50 1560 1570 

GTTTGTAAAT CCTTAGACGC TTGAGAATAA lArpr'^rrr n pmrw^ 

CAAACATTTA GGAATCTT^r a*^^™. AACCCTTCCC TTTTCTTCCG AAAAAAAAAA 
UAAACATTTA GGAATCTGCG AACTCTTATT TTGGGAAGGG AAAAGAAGGC rTTTTTTTTT 

* 

AAAAAAAA (SEQ ID NO: 29) 
rTTTTTTT 



FIG. 14D 
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MKK7e 

S equence R ange: 1 to 1598 

10 20 

* * * * 

AGCGCAGGCG CAGTGCGGTG 
TCGCGTCCGC GTCACGCCAC 

70 80 

* * * * 

GCGGCAGCGG CGGCGGGGAA 
CGCCGTCGCC GCCGCCCCTT 



50/54 



30 40 
* * * * 

TTTGTCTACC CCGGACTGAC 
AAACAGATGG GGCC7GACTG 

90 

* * * 

G ATG GCG GCG TCC TCC 
C TAG CGC CGC AGG AGG 
Met Ala Ala Ser Ser 



50 60 

* * * # 

GGGTGGCCTG GCGGTGAGCG 
CCCACCGGAC CGCCACTCGC 

100 no 

* * * 

CTG GAG CAG AAG CTG 
GAC CTC GTC TTC GAC 
Leu Glu Gin Lys Leu> 



120 130 
* * * * * 

TCC CGC CTG GAA GCC AAG CTG AAG 
AGG GCG GAC CTT CGG TTC GAC TTC 
Ser Arg Leu Glu Ala Lys Leu Lys 



140 150 

* * * * 

CAG GAG AAC CGT GAG GCC CGC AGG 
GTC CTC TTG GCA CTC CGG GCG TCC 
Gin Glu Asn Arg Glu Ala Arg Arg> 



160 170 
* * * 

AGG ATC GAC CTC 
TCC TAG CTG GAG 
Arg lie Asp Leu 



180 

AAC TTG GAT ATC 
TTG AAC CTA TAG 
Asn Leu Asp lie 



190 

* * 

AGC CCA CAG CGG 
TCG GGT GTC GCC 
Ser Pro Gin Arg 



200 

* * 

CCC AGG CCC ACC 
GGG TCC GGG TGG 
Pro Arg Pro Thr> 



210 220 230 

* * * * * 

CTG CAA CTC CCA CTG GCC AAC GAT 
GAC GTT GAG GGT GAC CGG TTG CTA 
Leu Gin Leu Pro Leu Ala Asn Asp 



240 250 
* * * * * 

GGG GGC AGC CGC TCA CCA TCC TCA 
CCC CCG TCG GCG AGT GGT AGG AGT 
Gly Gly Ser Arg Ser Pro Ser Ser> 



2eo 270 280 290 300 

* * * ****** 

GAG AGC TCC CCA CAG CAC CCT ACA CCC CCC ACC CGG CCC CGC CAC ATG 
CTC TCG AGG GGT GTC GTG GGA TGT GGG GGG TGG GCC GGG GCG GTG TAG 
Glu Ser Ser Pro Gin His Pro Thr Pro Pro Thr Arg Pro Arg His Met> 



310 320 
* * * * * 

CTG GGG CTC CCA TCA ACC TTG TTC 
GAC CCC GAG GGT AGT TGG AAC AAG 
Leu Gly Leu Pro Ser Thr Leu Phe 



360 370 

* * * * * 

GAG ATT GAC CAG AAG CTG CAG GAG 

CTC TAA CTG GTC TTC GAC GTC CTC 

Glu lie Asp Gin Lys Leu Gin Glu 



330 340 350 

***** 

ACA CCG CGC AGT ATG GAG AGC ATC 
TGT GGC GCG TCA TAC CTC TCG TAG 
Thr Pro Arg Ser Met Glu Ser Ile> 



380 390 

* * * * 

ATC ATG AAG CAG ACA GGG TAC CTG 
TAG TAC TTC GTC TGT CCC ATG GAC 
He Met Lys Gin Thr Gly Tyr Leu> 



400 410 420 

* * * * • 

ACT ATC GGG GGC CAG CGT TAT CAG 
TGA TAG CCC CCG GTC GCA ATA GTC 
Thr He Gly Gly Gin Arg Tyr Gin 



430 440 
***** 

GCA GAA ATC AAT GAC TTG GAG AAC 
CGT CTT TAG TTA CTG AAC CTC TTG 
Ala Glu lie Asn Asp Leu Glu Asn> 

FIG. 15A 
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"I . "! . ™ 480 „0 

L.U Gly G1U 2S g 5 g « - « - « JCC « gc OC^ 



500 510 
* * • 

TTC CGG AAG ACA GGC 
AAG GCC TTC TGT CCG 
Phe Arg Lys Thr Gly 



520 

CAC ATC ATT GCT GTT AAG 
GTG TAG TAA CGA CAA TTC 
His lie lie Ala Val Lys 



530 540 
* * * 

CAA ATG CGG CGC TCT 
GTT TAC GCC GCG AGA 
Gin Met Arg Arg Ser> 



550 560 

GGG AAC AAG GAA GAG AAT AAG CGC 
CCC TTG TTC CTT CTC TTA TTC GCG 
Gly Asn Lys Glu Glu Asn Lys Arg 



570 580 590 

* * * * * 

ATT TTG ATG GAC CTG GAT GTA GTA 
TAA AAC TAC CTG GAC CTA CAT CAT 
He Leu Met Asp Leu Asp Val Val> 



600 6X0 

# 

CTC AAG AGC CAT GAC TGC CCT TAC 
GAG TTC TCG GTA CTG ACG GGA ATG 
Leu Lys Ser His Asp Cys Pro Tyr 



620 630 

♦ * * ♦ 

ATC GTT CAG TGC TTT GGC ACC TTC 
TAG CAA GTC ACG AAA CCG TGG AAG 
He- Val Gin Cys Phe Gly Thr Phe> 



A!. ol » w . ^ , y . ™ 5 SfSSSSSiSSSS 2L 



"! . 75 : . «; "o „, 

s s ^ s s s s s s si 001 °™ tac ; " « "° 

i-u sly Ly, „« i£ £f S £i Si? s c J,™ " A ^ m 

xxe vai Lys Ala Leu Tyr Tyr Leu Lys> 



790 800 

* 

GAG AAG CAT GGC GTC ATC CAT CGC 
CTC TTC GTA CCG CAG TAG GTA GCG 
Glu Lys His Gly Val He His Arg 



"0 820 830 

* * * * * 

GAT GTC AAA CCC TCC AAC ATC CTG 
CTA CAG TTT GGG AGG TTG TAG GAC 
Asp Val Lys Pro Ser Asn He Leu> 



840 850 

CTA GAT GAG CGG GGC CAG ATC AAG 
GAT CTA CTC GCC CCG GTC TAG TTC 
Leu Asp Glu Arg Gly Gin He Lys 



860 870 

* * * * 

CTC TGT GAC TTT GGC ATC AGT GGC 
GAG ACA CTG AAA CCG TAG TCA CCG 
Leu Cys Asp Phe Gly H e Ser Gly> 

FIG. 15B 
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880 890 900 910 920 

********** 

CGC CTT GTT GAC TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC 
GCG GAA CAA CTG AGG TTT CGG TTT TGT GCC TCA CGA CCG ACA CGA CGG 
Arg Leu Val Asp Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala> 



930 940 9so 

* * * * * 

TAT ATG GCT CCC GAG CGC ATC GAC 
ATA TAC CGA GGG CTC GCG TAG CTG 
Tyr Met Ala Pro Glu Arg He Asp 



960 970 
* * * * * 

CCT CCA GAT CCC ACC AAG CCT GAC 
GGA GGT CTA GGG TGG TTC GGA CTG 
Pro Pro Asp Pro Thr Lys Pro Asp> 



980 990 iQ, 

* * * * 

TAT GAC ATC CGA GCT GAT GTG TGG 
ATA CTG TAG GCT CGA CTA CAC ACC 
Tyr Asp He Arg Ala Asp Val Trp 



)0 1010 1020 

* * * * * 

AGC CTG GGC ATC TCA CTG GTG GAG 
TCG GAC CCG TAG AGT GAC CAC CTC 
Ser Leu Gly He Ser Leu Val Glu> 



1030 1040 1050 1060 1070 

* * * * ****** 

CTG GCA ACA GGA CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG 
GAC CGT TGT CCT GTC AAG GGG ATA TTC TTG ACG TTC TGC CTG AAA CTC 
Leu Ala Thr Gly Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu> 



1080 1090 
* * • * * 

GTC CTC ACC AAA GTC CTA CAG GAA 
CAG GAG TGG TTT CAG GAT GTC CTT 
Val Leu Thr Lys Val Leu Gin Glu 



1100 1110 

* * * * 

GAG CCC CCA CTC CTG CCT GGT CAC 
CTC GGG GGT GAG GAC GGA CCA GTG 
Glu Pro Pro Leu Leu Pro Gly His> 



1120 1130 1140 

***** 

ATG GGC TTC TCA GGG GAC TTC CAG 
TAC CCG AAG AGT CCC CTG AAG GTC 
Met Gly Phe Ser Gly Asp Phe Gla 



1150 1160 
***** 

TCA TTT GTC AAA GAC TGC CTT ACT 
AGT AAA CAG TTT CTG ACG GAA TGA 
Ser Phe Val Lys Asp Cys Leu Thr> 



1170 1180 H90 

***** 

AAA GAT CAC AGG AAG AGA CCA AAG 
TTT CTA GTG TCC TTC TCT GGT TTC 
Lys Asp His Arg Lys Arg Pro Lys 



1200 1210 
* * * * * 

TAT AAT AAG CTA CTT GAA CAC AGC 
ATA TTA TTC GAT GAA CTT GTG TCG 
Tyr Asn Lys Leu Leu Glu His Ser> 



1220 1230 12. 

* * * * 

TTC ATC ATC AAG CAC TAT GAG ATA 
AAG TAG TAG TTC GTG ATA CTC TAT 
Phe He He Lys His Tyr Glu He 



10 1250 1260 

* * * * * 

CTC GAG GTG GAT GTC GCG TCC TGG 
GAG CTC CAC CTA CAG CGC AGG ACC 
Leu Glu Val Asp Val Ala Ser Trp> 



1270 ^ 1280 1290 1300 1310 

* 

3A GTC 
7T CAC 

FIG. 15C 



TTT AAG GAT GTC ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC 
AAA TTC CTA CAG TAC CGC TTC TGG CTC AGG GGT TCC TGA TCA CCT CAG 
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Phe Lys Asp Val Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Giy Val> 



1320 1330 1340 1350 

*** ****** 

CTG AGT CAG CAC CAT CTG CCC TTC TTC AGT GGG AGT CTG GAG GAG TCT 
GAC TCA GTC GTG GTA GAC GGG AAG AAG TCA CCC TCA GAC CTC CTC AGA 
Leu Ser Gin His His Leu Pro Phe Phe Ser Gly Ser Leu Glu Glu Ser> 



1360 1370 1380 

* * * * * 

CCC ACT TCC CCA CCT TCT CCC AAG 
GGG TGA AGG GGT GGA AGA GGG TTC 
Pro Thr Ser Pro Pro Ser Pro Lys 



1390 1400 
* * * * • 

TCC TTC CCT CTG TCA CCA GCC ATC 
AGG AAG GGA GAC AGT GGT CGG TAG 
Ser Phe Pro Leu Ser Pro Ala Ile> 



1410 1420 1430 

* * * * + 

CCT CAG GCC CAG GCA GAG TGG GTC 
GGA GTC CGG GTC CGT CTC ACC CAG 
Pro Gin Ala Gin Ala Glu Trp Val 



1440 1450 1460 

* * * * * # 

TCG GGC AGG TAGGGACCTG GAGTGGCCTG 
AGC CCG TCC ATCCCTGGAC CTCACCGGAC 
Ser Gly Arg> (SEQ ID NO: 32) 



1470 1480 1490 1500 1510 1520 

* * * * * * * * * * # * 

GTCCCACCCT CTGACCTCCT CCTCAGGCCA CCAGTGTTGC CCTCTTCCCT TTTTAAAACA 
CAGGGTGGGA GACTGGAGGA GGAGTCCGGT GGTCACAACG GGAGAAGGGA AAAATTTTGT 

1530 1540 1550 1560 1570 1580 

* * * * * * ** * * * # 

AAA TACCCTT GTTTGTAAAT CCTTAGACGC TTGAGAATAA AACC C TTCCC TTTTCTTCC G 
TTTATGGGAA CAAACATTTA GGAATCTGCG AACTCTTATT TTGGGAAGGG AAAAGAAGGC 

1590 

* * * 

AAAAAAAAAA AAAAAAAA (SEQ ID NO: 31) 
TTTTTTTTTT TTTTTTTT 

FIG. 15D 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/141G1 



54/54 



Relative 
Luciferase 2 
Activity 

1 



FIG. 16A 




MKK4 
MKK7 
JNK1 

JNK1 (APF) 



200-. 

Relative 150- 
Luciferase 
Activity 10Q 

50 - 




FIG. 16B 



I l T I I I I T I 

- + ++-- - + ++--MKK4 

- - + ++ + + + MKK7 

+-+-+- + - + - + - JNK1 

+- + -+ +- + -+ JNK1 (APF) 



ATF-2 



mATF-2 



SUBSTITUTE SHEET (RULE 26) 



WO 99/02547 



PCT/US98/14101 



1 



SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: University of Massachusetts 

(ii) TITLE OF THE INVENTION: CYTOKINE-, STRESS-, AND ONCOPROTEIN- 

ACTIVATED HUMAN PROTEIN KINASE KINASES 

(iii) NUMBER OF SEQUENCES: 34 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM : Windows 9 5 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US98/14101 

(B) FILING DATE: 07-JUL-1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/888,429 

(B) FILING DATE: 07-JUL-1997 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Fasse, Peter J. 

(B) REGISTRATION NUMBER: 32,983 

(C) REFERENCE/DOCKET NUMBER: 07917/053WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 299354 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 03 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 338... 1291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TGGCTGGCAA TGGCCTTGCT GACCTCGAGC CGGGCCCACG TGGGGACCTT TGGAGCACAG 
CCTACGATCC TGGTGCAAGG CCGGTGGATG CAGAGGCCAG TCCATATACC ACCCAGGCCT 
GCGAGGAGCG TGGTCCCCAC CCATCCAGCC CATATGTGCA AGTGCCCTTG ACAGAGAGGC 
TGGTCATATC CATGGTGACC ATTTATGGGC CACAACAGGT CCCCATCTGC GCAGTGAACC 
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CTGTGCTGAG CACCTTGCAG ACGTGATCTT GCTTCGTCCT GCAGCACTGT GCGGGGCAGG 3 00 

AAAATCCAAG AGGAAGAAGG ATCTACGGAT ATCCTGC ATG TCC AAG CCA CCC GCA 3 55 

Met Ser Lys Pro Pro Ala 
1 5 

5 CCC AAC CCC ACA CCC CCC CGG AAC CTG GAC TCC CGG ACC TTC ATC ACC 4 03 

Pro Asn Pro Thr Pro Pro Arg Asn Leu Asp Ser Arg Thr Phe lie Thr 
10 15 20 

ATT GGA GAC AGA AAC TTT GAG GTG GAG GCT GAT GAC TTG GTG ACC ATC 451 
10 lie Gly Asp Arg Asn Phe Glu Val Glu Ala Asp Asp Leu Val Thr lie 

25 30 35 

TCA GAA CTG GGC CGT GGA GCC TAT GGG GTG GTA GAG AAG GTG CGG CAC 4 99 

Ser Glu Leu Gly Arg Gly Ala Tyr Gly Val Val Glu Lys Val Arg His 
40 45 50 

15 GCC CAG AGC GGC ACC ATC ATG GCC GTG AAG CGG ATC CGG GCC ACC GTG 54 7 

Ala Gin Ser Gly Thr He Met Ala Val Lys Arg He Arg Ala Thr Val 
55 60 65 70 

AAC TCA CAG GAG CAG AAG CGG CTG CTC ATG GAC CTG GAC ATC AAC ATG 5 95 

Asn Ser Gin Glu Gin Lys Arg Leu Leu Met Asp Leu Asp He Asn Met 
20 75 80 ** 85 

CGC ACG GTC GAC TGT TTC TAC ACT GTC ACC TTC TAC GGG GCA CTA TTC 643 
Arg Thr Val Asp Cys Phe Tyr Thr Val Thr Phe Tyr Gly Ala Leu Phe 
90 95 100 

25 AGA GAG GGA GAC GTG TGG ATC TGC ATG GAG CTC ATG GAC ACA TCC TTG 691 

Arg Glu Gly Asp Val Trp He Cys Met Glu Leu Met Asp Thr Ser Leu 
105 110 115 

GAC AAG TTC TAC CGG AAG GTG CTG GAT AAA AAC ATG ACA ATT CCA GAG 73 9 

Asp Lys Phe Tyr Arg Lys Val Leu Asp Lys Asn Met Thr He Pro Glu 
30 120 125 130 

GAC ATC CTT GGG GAG ATT GCT GTG TCT ATC GTG CGG GCC CTG GAG CAT 787 
Asp He Leu Gly Glu He Ala Val Ser He Val Arg Ala Leu Glu His 
13 5 14 0 145 150 

CTG CAC AGC AAG CTG TCG GTG ATC CAC AGA GAT GTG AAG CCC TCC AAT 835 
35 Leu His Ser Lys Leu Ser Val He His Arg Asp Val Lys Pro Ser Asn 

155 160 165 

GTC CTT ATC AAC AAG GAG GGC CAT GTG AAG ATG TGT GAC TTT GGC ATC 8 83 

Val Leu He Asn Lys Glu Gly His Val Lys Met Cys Asp Phe Gly He 
170 175 180 

4 0 AGT GGC TAC TTG GTG GAC TCT GTG GCC AAG ACG ATG GAT GCC GGC TGC 931 

Ser Gly Tyr Leu Val Asp Ser Val Ala Lys Thr Met Asp Ala Gly Cys 
185 190 195 

AAG CCC TAC ATG GCC CCT GAG AGG ATC AAC CCA GAG CTG AAC CAG AAG 97 9 

Lys Pro Tyr Met Ala Pro Glu Arg He Asn Pro Glu Leu Asn Gin Lys 
45 200 205 210 

GGC TAC AAT GTC AAG TCC GAC GTC TGG AGC CTG GGC ATC ACC ATG ATT 1027 
Gly Tyr Asn Val Lys Ser Asp Val Trp Ser Leu Gly He Thr Met He 
215 220 225 " 230 



50 



GAG ATG GCC ATC CTG CGG TTC CCT TAC GAG TCC TGG GGG ACC CCG TTC 1075 
Glu Met Ala He Leu Arg Phe Pro Tyr Glu Ser Trp Gly Thr Pro Phe 
235 240 ~ 245 
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CAG CAG CTG AAG CAG GTG GTG GAG GAG CCG TCC CCC CAG CTC CCA GCC 1123 
Gin Gin Leu Lys Gin Val Val Glu Glu Pro Ser Pro Gin Leu Pro Ala 
250 255 260 

GAC CGT TTC TCC CCC GAG TTT GTG GAC TTC ACT GCT CAG TGC CTG AGG 1171 
5 Asp Arg Phe Ser Pro Glu Phe Val Asp Phe Thr Ala Gin Cys Leu Arg 
265 270 275 

AAG AAC CCC GCA GAG CGT ATG AGC TAC CTG GAG CTG ATG GAG CAC CCC 1219 
Lys Asn Pro Ala Glu Arg Met Ser Tyr Leu Glu Leu Met Glu His Pro 
280 285 290 

10 TTC TTC ACC TTG CAC AAA ACC AAG AAG ACG GAC ATT GCT GCC TTC GTG 12 67 
Phe Phe Thr Leu His Lys Thr Lys Lys Thr Asp He Ala Ala Phe Val 
295 300 305 310 

AAG AAG ATC CTG GGA GAA GAC TCA TAGGGGCTGG GCCTCGGACC CCACTCCGGC 1321 
Lys Lys He Leu Gly Glu Asp Ser 
15 315 

CCTCCAGAGC CCCACAGCCC CATCTGCGGG GGCAGTGCTC ACCCACACCA TAAGCTACTG 13 81 

CCATCCTGGC CCAGGGCATC TGGGAGGAAC CGAGGGGGCT GCTCCCACCT GGCTCTGTGG 1441 

CGAGCCATTT GTCCCAAGTG CCAAAGAAGC AG AC C ATTGG GGCTCCCAGC CAGGCCCTTG 15 01 

TCGGCCCCAC CAGTGCCTCT CCCTGCTGCT CCTAGGACCC GTCTCCAGCT GCTGAGATCC 15 61 

2 0 TGGACTGAGG GGGCCTGGAT GCCCCCTGTG GATGCTGCTG CCCCTGCACA GCAGGCTGCC 1621 

AGTGCCTGGG TGGATGGGCC ACCGCCTTGC CCAGCCTGGA TGC CATC CAA GTTGTATATT 16 81 

TTTTTAATCT CTCGACTGAA TGGACTTTGC ACACTTTGGC CCAGGGTGGC CACACCTCTA 1741 

TCCCGGCTTT GGTGCGGGGT ACACAAGAGG GGATGAGTTG TGTGAATACC CCAAGACTCC 1801 

CATGAGGGAG ATGCCATGAG CCGCCCAAGG CCTTCCCCTG GCACTGGCAA ACAGGGCCTC 1861 

2 5 TGCGGAGCAC ACTGGCTCAC CCAGTCCTGC CCGCCACCGT TATCGGTGTC ATTCACCTTT 1921 

CGTGTTTTTT TTAATTTATC CTCTGTTGAT TTTTTCTTTT GCTTTATGGG TTTGGCTTGT 1981 

TTTTCTTGCA TGGTTTGGAG CTGATCGCTT CTCCCCCACC CCCTAGGGG 2 03 0 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



40 



Met 


Ser 


Lys 


Pro 


Pro 


Ala 


Pro 


Asn 


Pro 


Thr 


Pro 


Pro 


Arg 


Asn 


Leu Asp 


1 








5 










10 








15 


Ser 


Arg 


Thr 


Phe 


He 


Thr 


He 


Gly 


Asp 


Arg 


Asn 


Phe 


Glu 


Val 


Glu Ala 








20 










25 










30 




Asp 


Asp 


Leu 


Val 


Thr 


He 


Ser 


Glu 


Leu 


Gly 


Arg 


Gly 


Ala 


Tyr 


Gly Val 






35 










40 










45 


Val 


Glu 


Lys 


Val 


Arg 


His 


Ala 


Gin 


Ser 


Gly 


Thr 


He 


Met 


Ala 


Val Lys 




50 










55 










60 






Arg 


He 


Arg 


Ala 


Thr 


Val 


Asn 


Ser 


Gin 


Glu 


Gin 


Lys 


Arg 


Leu 


Leu Met 


65 










70 










75 




80 


Asp 


Leu 


Asp 


He 


Asn 


Met 


Arg 


Thr 


Val 


Asp 


Cys 


Phe 


Tyr 


Thr 


Val Thr 










85 










90 










95 


Phe 


Tyr 


Gly 


Ala 


Leu 


Phe 


Arg 


Glu 


Gly 


Asp 


Val 


Trp 


He 


Cys 


Met Glu 








100 










105 










110 




Leu 


Met 


Asp 


Thr 


Ser 


Leu 


Asp 


Lys 


Phe 


Tyr 


Arg 


Lys 


Val 


Leu 


Asp Lys 






115 










120 










125 




Asn 


Met 


Thr 


He 


Pro 


Glu 


Asp 


He 


Leu 


Gly 


Glu 


He 


Ala 


Val 


Ser He 




130 










135 








140 
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Val 


Arg Ala 


Leu 


Glu 


His 


Leu 


11. X O 


Coy 

OCX. 


Lys 


Leu 


Ser 


vai 


lie 


TT -! „ 

HIS 


Arg 


145 










150 










155 










loU 


Asp 

XT 


Val 




Pro 


Ser 


Asn 


Val 


Leu 


He 


Asn 


Lys 


Glu 






V ci J_ 


Lys 










165 










170 










1 1 d 

JL / D 


Met 


Cys 


Asp 


Phe 


Glv 


He 


Ser 


Glv 

vjxy 


Tvr 
xy x 


Leu 


Val 


Asp 


OCX 


v ell 


M.X ci 


Lys 








180 










185 








X Z> u 




Thr 


Met 


Asp 


Ala 


Glv 


Cys 


Lys 


Pro 


Tvr 
xy x 


Met 


Ala 


XT X U 




Arg 


lie 


Asn 






i or 

_L _7 _) 










200 










2 05 






Pro 


Glu 


Leu 


Asn 


Gin 


Lys 


Gly 


Tvt 
xy x 


Asn 


Val 


xjy o 


OCX 


Asp 


V cix 


Trp 


Ser 




210 










215 










220 






Leu 


Gly 


He 


Thr 


Met 


He 


Glu 


Met 


Ala 


He 


Leu 




xrlic 




Tyr 


vjIU 


225 










230 










235 






ZftU 


Ser 


Trp 


Gly 


Thr 


Pro 


Phe 


Gin 


Gin 


Leu 


Lys 


Gin 


Val 


Val 






rXO 










245 










250 










255 




Ser 


Pro 


Gin 


Leu 


Pro 


Ala 


Asp 


Arg 


Phe 


Ser 


Pro 


Glu 


Phe 


Val 


Asp 


Phe 








260 










265 










270 




Thr 


Ala 


Gin 


Cys 


Leu 


Arg 


Lys 


Asn 


Pro 


Ala 


Glu 


Arg 


Met 


Ser 


Tyr 


Leu 






275 










280 








285 






Glu 


Leu 


Met 


Glu 


His 


Pro 


Phe 


Phe 


Thr 


Leu 


His 


Lys 


Thr 


Lys 


Lys 


Thr 




290 










295 










300 






Asp 


He 


Ala 


Ala 


Phe 


Val 


Lys 


Lys 


He 


Leu 


Gly 


Glu 


Asp 


Ser 






305 










310 










315 











(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

3 0 (ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 244 ... 1245 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

TAGCTGCAGC ACAGCCTTCC CTAACGTTGC AACTGGGGGA AAAATCACTT TCCAGTCTGT 6 0 

3 5 TTTGCAAGGT GTGCATTTCC ATCTTGATT C CCTGAAAGTC CATCTGCTGC ATCGGTCAAG 12 0 

AGAAACTCCA CTTGCATGAA GATTGCACGC CTGCAGCTTG CATCTTTGTT GCAAAACTAG 18 0 

CTACAGAAGA GAAGCAAGGC AAAGTCTTTT GTGCTCCCCT CCCCCATCAA AGGAAAGGGG 24 0 

AAA ATG TCT CAG TCG AAA GGC AAG AAG CGA AAC CCT GGC CTT AAA ATT 2 88 
Met Ser Gin Ser Lys Gly Lys Lys Arg Asn Pro Gly Leu Lys He 
40 1 5 10 * ^ 15 

CCA AAA GAA GCA TTT GAA CAA CCT CAG ACC AGT TCC ACA CCA CCT AGA 33 6 

Pro Lys Glu Ala Phe Glu Gin Pro Gin Thr Ser Ser Thr Pro Pro Arg 
20 25 30 

GAT TTA GAC TCC AAG GCT TGC ATT TCT ATT GGA AAT CAG AAC TTT GAG 3 84 

45 Asp Leu Asp Ser Lys Ala Cys He Ser He Gly Asn Gin Asn Phe Glu 

35 40 45 

GTG AAG GCA GAT GAC CTG GAG CCT ATA ATG GAA CTG GGA CGA GGT GCG 43 2 

Val Lys Ala Asp Asp Leu Glu Pro He Met Glu Leu Gly Arg Gly Ala 
50 55 60 

50 TAC GGG GTG GTG GAG AAG ATG CGG CAC GTG CCC AGC GGG CAG ATC ATG 4 80 

Tyr Gly Val Val Glu Lys Met Arg His Val Pro Ser Gly Gin He Met 
65 70 75 
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GCA GTG AAG CGG ATC CGA GCC ACA GTA AAT AGC CAG GAA CAG AAA CGG 52 8 

Ala Val Lys Arg lie Arg Ala Thr Val Asn Ser Gin Glu Gin Lys Arg 
80 85 90 95 

CTA CTG ATG GAT TTG GAT ATT TCC ATG AGG ACG GTG GAC TGT CCA TTC 576 
5 Leu Leu Met Asp Leu Asp lie Ser Met Arg Thr Val Asp Cys Pro Phe 

100 105 110 

ACT GTC ACC TTT TAT GGC GCA CTG TTT CGG GAG GGT GAT GTG TGG ATC 62 4 

Thr Val Thr Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Val Trp lie 
115 120 125 

10 TGC ATG GAG CTC ATG GAT ACA TCA CTA GAT AAA TTC TAC AAA CAA GTT 672 

Cys Met Glu Leu Met Asp Thr Ser Leu Asp Lys Phe Tyr Lys Gin Val 
130 135 140 

ATT GAT AAA GGC CAG ACA ATT CCA GAG GAC ATC TTA GGG AAA ATA GCA 72 0 

lie Asp Lys Gly Gin Thr lie Pro Glu Asp lie Leu Gly Lys He Ala 
15 145 150 155 

GTT TCT ATT GTA AAA GCA TTA GAA CAT TTA CAT AGT AAG CTG TCT GTC 7 68 

Val Ser He Val Lys Ala Leu Glu His Leu His Ser Lys Leu Ser Val 
160 165 170 175 



20 



ATT CAC AGA GAC GTC AAG CCT TCT AAT GTA CTC ATC AAT GCT CTC GGT 816 
He His Arg Asp Val Lys Pro Ser Asn Val Leu He Asn Ala Leu Gly 
180 185 190 

CAA GTG AAG ATG TGC GAT TTT GGA ATC AGT GGC TAC TTG GTG GAC TCT 8 64 

Gin Val Lys Met Cys Asp Phe Gly He Ser Gly Tyr Leu Val Asp Ser 
195 20 0 " 2 05 

25 GTT GCT AAA ACA ATT GAT GCA GGT TGC AAA CCA TAC ATG GCC CCT GAA 912 

Val Ala Lys Thr He Asp Ala Gly Cys Lys Pro Tyr Met Ala Pro Glu 
210 215 220 

AGA ATA AAC CCA GAG CTC AAC CAG AAG GGA TAC AGT GTG AAG TCT GAC 9 60 

Arg He Asn Pro Glu Leu Asn Gin Lys Gly Tyr Ser Val Lys Ser Asp 
30 225 230 ~ 235 

ATT TGG AGT CTG GGC ATC ACG ATG ATT GAG TTG GCC ATC CTT CGA TTT 10 0 8 

He Trp Ser Leu Gly He Thr Met He Glu Leu Ala He Leu Arg Phe 
240 245 250 255 

CCC TAT GAT TCA TGG GGA ACT CCA TTT CAG CAG CTC AAA CAG GTG GTA 1056 
35 Pro Tyr Asp Ser Trp Gly Thr Pro Phe Gin Gin Leu Lys Gin Val Val 

260 265 "* 270 

GAG GAG CCA TCG CCA CAA CTC CCA GCA GAC AAG TTC TCT GCA GAG TTT 1104 
Glu Glu Pro Ser Pro Gin Leu Pro Ala Asp Lys Phe Ser Ala Glu Phe 
275 280 285 

40 GTT GAC TTT ACC TCA CAG TGC TTA AAG AAG AAT TCC AAA GAA CGG CCT 1152 

Val Asp Phe Thr Ser Gin Cys Leu Lys Lys Asn Ser Lys Glu Arg Pro 
290 295 300 

ACA TAC CCA GAG CTA ATG CAA CAT CCA TTT TTC ACC CTA CAT GAA TCC 12 0 0 

Thr Tyr Pro Glu Leu Met Gin His Pro Phe Phe Thr Leu His Glu Ser 
45 305 310 315 

AAA GGA ACA GAT GTG GCA TCT TTT GTA AAA CTG ATT CTT GGA GAC TAAAA 125 0 
Lys Gly Thr Asp Val Ala Ser Phe Val Lys Leu He Leu Gly Asp 
320 325 330 

50 AGCAGTGGAC TTAATCGGTT GACCCTACTG TGGATTGGTG GGTTTCGGGG TGAAGCAAGT 1310 

TC AC TAC AGC AT C AAT AGAA AGT CATCTTT GAGATAATTT AACCCTGCCT CTCAGAGGGT 137 0 
TTTCTCTCCC AATTTTCTTT TTACTCCCCC TCTTAAGGGG GCCTTGGAAT C TAT AGT AT A 143 0 
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GAATGAACTG TCTAGATGGA TGAATTATGA 
ATATTTAATG ATGTGTCATA TGAGTCCTCA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 



6 



TAAAGGCTTA GGACTTCAAA AGGTGATTAA 14 9 0 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1550 
AAAAAAAAAA AAAAAAAAAA AA 1602 



(2) INFORMATION FOR SEQ ID NO : 4 : 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 





Met 


Ser 


Gin 


Ser 


Lys 


Gly 


Lys 


Lys 


Arq 


Asn 


Pro 


Glv 


Leu 


Lys 


He 


Pro 

£T J- \J 




1 








5 










10 








J- 3 




15 


Lys 


Glu 


Ala 


Phe 


Glu 


Gin 


Pro 


Gin 


Thr 


Ser 


Ser 


Thr 


Pro 


Pro 




7\ c*TT\ 
"P 








20 










25 














Leu 


Asp 


Ser 


Lys 


Ala 


Cys 


He 


Ser 


He 


Gly Asn 


Gin 


Asn 


Phe 


ril ii 

ul Li 


V d J_ 








35 










40 










4 5 










Lys 


Ala 


Asp 


Asp 


Leu 


Glu 


Pro 


He 


Met 


Glu 


Leu 


Gly 


Arg 


Gly 


Ala 


Tyr 






50 










55 










60 






20 


Gly Val 


Val 


Glu 


Lys 


Met 


Arg 


His 


Val 


Pro 


Ser 


Glv 


Gin 


He 


Met 


Ala 




65 










70 










75 








P A 
O U 




Val 


Lys 


Arg 


He 


Arg 


Ala 


Thr 


Val 


Asn 


Ser 


Gin 


Glu 


Gin 


Lys 


Arg 


Leu 












85 










90 








Q tr 




25 


Leu 


Met 


Asp 


Leu 


Asp 


He 


Ser 


Met 


Arg 


Thr 


Val 


Asp 


Cys 


Pro 


Phe 


Thr 








100 










105 






110 








Val 


Thr 


Phe 


Tyr 


Gly 


Ala 


Leu 


Phe 


Arg 


Glu 


Gly 


Asp 


Val 


Trp 


He 


Cys 








115 










120 










125 






Met 


Glu 


Leu 


Met 


Asp 


Thr 


Ser 


Leu 


Asp 


Lys 


Phe 


Tyr 


Lys 


Gin 


Val 


He 






130 










135 










140 








30 


Asp 


Lys 


Gly Gin 


Thr 


He 


Pro 


Glu 


Asp 


He 


Leu 


Gly 


Lys 


He 


Ala 


Val 




145 










150 










155 






160 




Ser 


He 


Val 


Lys 


Ala 


Leu 


Glu 


His 


Leu 


His 


Ser 


Lys 


Leu 


Ser 


Val 


He 




His 








165 










170 








175 




35 


Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


Val 


Leu 


He 


Asn 


Ala 


Leu 


Gly 


Gin 








180 










185 










190 






Val 


Lys 


Met 


Cys 


Asp 


Phe 


Gly 


He 


Ser 


Gly Tyr 


Leu 


Val 


Asp 


Ser 


Val 








195 










200 










205 








Ala 


Lys 


Thr 


He 


Asp 


Ala 


Gly 


Cys 


Lys 


Pro 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 






210 










215 










220 








40 


He 


Asn 


Pro 


Glu 


Leu 


Asn 


Gin 


Lys 


Gly 


Tyr 


Ser 


Val 


Lys 


Ser 


Asp 


He 




225 










230 










235 






240 




Trp 


Ser 


Leu 


Gly 


He 


Thr 


Met 


He 


Glu 


Leu 


Ala 


He 


Leu 


Arg 


Phe 


Pro 












245 










250 








255 




45 


Tyr 


Asp 


Ser 


Trp 


Gly 


Thr 


Pro 


Phe 


Gin 


Gin 


Leu 


Lys 


Gin 


Val 


Val 


Glu 








260 










265 








270 








Glu 


Pro 


Ser 
275 


Pro 


Gin 


Leu 


Pro 


Ala 
280 


Asp 


Lys 


Phe 


Ser 


Ala 
285 


Glu 


Phe 


Val 




Asp 


Phe 


Thr 


Ser 


Gin 


Cys 


Leu 


Lys 


Lys 


Asn 


Ser 


Lys 


Glu 


Arg 


Pro 


Thr 






290 










295 










300 








50 


Tyr 


Pro 


Glu 


Leu 


Met 


Gin 


His 


Pro 


Phe 


Phe 


Thr 


Leu 


His 


Glu 


Ser 


Lys 




305 










310 










315 










320 




Gly Thr Asp Val 


Ala 


Ser 


Phe 


Val 


Lys 


Leu 


He 


Leu 


Gly 


Asp 





325 330 



(2) INFORMATION FOR SEQ ID NO : 5 : 



55 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3498 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

5 (ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 4 0... 112 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CTAGGGTCCC CGGCGCCAGG CCACCCGGCC GTCAGCAGC ATG CAG GGT AAA CGC 54 
10 Met Gin Gly Lys Arg 

1 5 

AAA GCA CTG AAG TTG AAT TTT GCA AAT CCA CCT TTC AAA TCT ACA GCA 10 2 

Lys Ala Leu Lys Leu Asn Phe Ala Asn Pro Pro Phe Lys Ser Thr Ala 
10 15 ~ 20 

15 AGG TTT ACT CTG AAT CCC AAT CCT ACA GGA GTT CAA AAC CCA CAC ATA 15 0 

Arg Phe Thr Leu Asn Pro Asn Pro Thr Gly Val Gin Asn Pro His lie 
25 30 35 

GAG AGA CTG AGA ACA CAC AGC ATT GAG TCA TCA GGA AAA CTG AAG ATC 198 
Glu Arg Leu Arg Thr His Ser lie Glu Ser Ser Gly Lys Leu Lys lie 
20 40 45 50 

TCC CCT GAA CAA CAC TGG GAT TTC ACT GCA GAG GAC TTG AAA GAC CTT 24 6 

Ser Pro Glu Gin His Trp Asp Phe Thr Ala Glu Asp Leu Lys Asp Leu 
55 60 65 

GGA GAA ATT GGA CGA GGA GCT TAT GGT TCT GTC AAC AAA ATG GTC CAC 2 94 

25 Gly Glu lie Gly Arg Gly Ala Tyr Gly Ser Val Asn Lys Met Val His 

70 75 80 85 

AAA CCA AGT GGG CAA ATA ATG GCA GTT AAA AGA ATT CGG TCA ACA GTG 342 
Lys Pro Ser Gly Gin lie Met Ala Val Lys Arg lie Arg Ser Thr Val 
90 95 " 100 

3 0 GAT GAA AAA GAA CAA AAA CAA CTT CTT ATG GAT TTG GAT GTA GTA ATG 3 90 

Asp Glu Lys Glu Gin Lys Gin Leu Leu Met Asp Leu Asp Val Val Met 
105 110 115 

CGG AGT AGT GAT TGC CCA TAC ATT GTT CAG TTT TAT GGT GCA CTC TTC 43 8 

3 5 Arg Ser Ser Asp Cys Pro Tyr lie Val Gin Phe Tyr Gly Ala Leu Phe 

120 125 130 

AGA GAG GGT GAC TGT TGG ATC TGT ATG GAA CTC ATG TCT ACC TCG TTT 4 86 

Arg Glu Gly Asp Cys Trp lie Cys Met Glu Leu Met Ser Thr Ser Phe 
135 140 145 

4 0 GAT AAG TTT TAC AAA TAT GTA TAT AGT GTA TTA GAT GAT GTT ATT CCA 53 4 

Asp Lys Phe Tyr Lys Tyr Val Tyr Ser Val Leu Asp Asp Val lie Pro 
150 155 160 ^ ~ 165 

GAA GAA ATT TTA GGC AAA ATC ACT TTA GCA ACT GTG AAA GCA CTA AAC 5 82 

Glu Glu lie Leu Gly Lys lie Thr Leu Ala Thr Val Lys Ala Leu Asn 
45 170 175 180 

CAC TTA AAA GAA AAC TTG AAA ATT ATT CAC AGA GAT ATC AAA CCT TCC 63 0 

His Leu Lys Glu Asn Leu Lys lie lie His Arg Asp He Lys Pro Ser 
185 190 195 
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AAT ATT CTT CTG GAC AGA AGT GGA AAT ATT AAG CTC TGT GAC TTC GGC 67 8 

Asn lie Leu Leu Asp Arg Ser Gly Asn lie Lys Leu Cys Asp Phe Gly 
200 205 210 

ATC AGT GGA CAG CTT GTG GAC TCT ATT GCC AAG ACA AGA GAT GCT GGC 72 6 

5 lie Ser Gly Gin Leu Val Asp Ser lie Ala Lys Thr Arg Asp Ala Gly 

215 220 225 

TGT AGG CCA TAC ATG GCA CCT GAA AGA ATA GAC CCA AGC GCA TCA CGA 774 
Cys Arg Pro Tyr Met Ala Pro Glu Arg lie Asp Pro Ser Ala Ser Arg 
230 235 240 245 

10 CAA GGA TAT GAT GTC CGC TCT GAT GTC TGG AGT TTG GGG ATC ACA TTG 822 

Gin Gly Tyr Asp Val Arg Ser Asp Val Trp Ser Leu Gly lie Thr Leu 
250 255 260 

TAT GAG TTG GCC ACA GGC CGA TTT CCT TAT CCA AAG TGG AAT AGT GTA 87 0 

Tyr Glu Leu Ala Thr Gly Arg Phe Pro Tyr Pro Lys Trp Asn Ser Val 
15 265 270 ~ 275 

TTT GAT CAA CTA ACA CAA GTC GTG AAA GGA GAT CCT CCG CAG CTG AGT 918 
Phe Asp Gin Leu Thr Gin Val Val Lys Gly Asp Pro Pro Gin Leu Ser 
280 285 290 

AAT TCT GAG GAA AGG GAA TTC TCC CCG AGT TTC ATC AAC TTT GTC AAC 96 6 

2 0 Asn Ser Glu Glu Arg Glu Phe Ser Pro Ser Phe lie Asn Phe Val Asn 

295 300 305 

TTG TGC CTT ACG AAG GAT GAA TCC AAA AGG CCA AAG TAT AAA GAG CTT 1014 
Leu Cys Leu Thr Lys Asp Glu Ser Lys Arg Pro Lys Tyr Lys Glu Leu 
310 315 320 ~ ' 325 

25 CTG AAA CAT CCC TTT ATT TTG ATG TAT GAA GAA CGT GCC GTT GAG GTC 1062 

Leu Lys His Pro Phe lie Leu Met Tyr Glu Glu Arg Ala Val Glu Val 
330 335 ' 340 

GCA TGC TAT GTT TGT AAA ATC CTG GAT CAA ATG CCA GCT ACT CCC AGC 1110 
Ala Cys Tyr Val Cys Lys He Leu Asp Gin Met Pro Ala Thr Pro Ser 
30 345 350 355 

TCT CCC ATG TAT GTC GAT TGATATCGYT GCTACATCAG ACTCTAGAAA AAAGGGCT 116 6 
Ser Pro Met Tyr Val Asp 
360 

GAGAGGAAGC AAGACGTAAA GAATTTTCAT CCCGTATCAC AGTGTTTTTA TTGCTCGCCC 122 6 

AGAC AC C ATG TGCAATAAGA TTGGTGTTCG TTTCCATCAT GTCTGTATAC TCCTGTCACC 1286 

TAGAACGTGC ATCCTTGTAA TACCTGATTG AT C ACAC AGT GTTAGTGCTG GTCAGAGAGA 134 6 

CCTCATCCTG CTCTTTTGTG ATGAACATAT TCATGAAATG TGGAAGTCAG TACGATCAAG 14 06 

TTGTTGACTG TGATTAGATC ACATCTTAAA TTCATTTCTA GACTCAAAAC CTGGAGATGC 1466 

AGCTACTGGA ATGGTGTTTT GTCAGACTTC CAAATCCTGG AAGGACACAG TGATGAATGT 152 6 

ACTATATCTG AACATAGAAA CTCGGGCTTG AGTGAGAAGA GCTTGCACAG CCAACGAGAC 15 86 

ACATTGCCTT CTGGAGCTGG GAGACAAAGG AGGAATTTAC TTTCTTCACC AAGTGCAATA 164 6 

GATTACTGAT GTGATATTCT GTTGCTTTAC AGTTACAGTT GATGTTTGGG GATCGATGTG 17 0 6 

CTCAGCCAAA TTTCCTGTTT GAAATATCAT GTTAAATTAG AATGAATTTA TCTTTACCAA 1766 

AAAC C ATGTT GCGTTCAAAG AGGTGAACAT TAAAATATAG AGACAGGACA GAATGTGTTC 1826 

TTTTCTCCTC TACCAGTCCT ATTTTTCAAT GGGAAGACTC AGGAGTCTGC CACTTGTCAA 1886 

AGAAGGTGCT GATCCTAAGA ATTTTTCATT CTCAGAATTC GGTGTGCTGC CAACTTGATG 1946 

TTCCACCTGC CACAAACCAC CAGGACTGAA AGAAGAAAAC AGTACAGAAG GCAAAGTTTA 2 006 

CAGATGTTTT TAATTCTAGT ATTTTATCTG GAACAACTTG TAGCAGCTAT ATATTTCCCC 2066 

TTGGTCCCAA GCCTGATACT TTAGCCATCA TAACTCACTA ACAGGGAGAA GTAGC TAGTA 2126 

GCAATGTGCC TTGATTGATT AGATAAAGAT TTCTAGTAGG CAGCAAAAGA CCAAATCTCA 2186 

GTTGTTTGCT TCTTGCCATC ACTGGTCCAG GTCTTCAGTT TCCGAATCTC TTTCCCTTCC 2 24 6 

CCTGTGGTCT ATTGTCGCTA TGTGACTTGC GCTTAATCCA ATATTTTGCC TTTTTTCTAT 2 3 06 

ATCAAAAAAC CTTTACAGTT AGCAGGGATG TTCCTTACCG AGGATTTTTA ACCCCCAATC 23 66 

TCTCATAATC GCTAGTGTTT AAAAGGCTAA GAATAGTGGG GCCCAACCGA TGTGGTAGGT 242 6 

GATAAAGAGG CATCTTTTCT AGAGACACAT TGGACCAGAT GAGGAT C CGA AACGGCAGCC 24 86 

TTTACGTTCA TCACCTGCTA GAACCTCTCG TAGTCCATCA CCATTTCTTG GCATTGGAAT 2 54 6 



40 



45 



50 
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TCTACTGGAA AAAAATACAA AAAGCAAAAC AAAACCCTCA GCACTGTTAC AAGAGGC CAT 26 06 

TTAAGTATCT TGTGCTTCTT CACTTACCCA TTAGCCAGGT TCTCATTAGG TTTTGCTTGG 2 6 66 

GCCTCCCTGG CACTGAACCT TAGGCTTTGT ATGACAGTGA AGCAGCACTG TGAGTGGTTC 2 72 6 

AAGCACACTG GAATATAAAA CAGTCATGGC CTGAGATGCA GGTGATGCCA TTAC AGAAC C 27 86 

5 AAATCGTGGC ACGTATTGCT GTGTCTCCTC TCAGAGTGAC AGTCATAAAT ACTGTCAAAC 2 846 

AATAAAGGGA GAATGGTGCT GTTTAAAGTC ACATCCCTGT AAATTGCAGA ATTCAAAAGT 2 9 06 

GATTATCTCT TTGATCTACT TGCCTCATTT CCCTATCTTC TCCCCCACGG TATCCTAAAC 2 966 

TTTAGACTTC CCACTGTTCT GAAAGGAGAC ATTGCTCTAT GTCTGCCTTC GAC CACAGCA 3 026 

AGCCATCATC CTCCATTGCT CCCGGGGACT CAAGAGGAAT CTGTTTCTCT GCTGTCAACT 3 0 86 

10 TCCCATCTGG CTCAGCATAG GGTCACTTTG CCATTATGCA AATGGAGATA AAAGCAATTC 314 6 

TGGCTGTCCA GGAGCTAATC TGACCGTTCT ATTGTGTGGA TGACCACATA AGAAGGCAAT 32 06 

TTTAGTGTAT TAATCATAGA TTATTATAAA CTATAAACTT AAGGGCAAGG AGTTTATTAC 32 66 

AATGTATCTT TATTAAAACA AAAGGGTGTA TAGTGTTCAC AAACTGTGAA AATAGTGTAA 3 326 

GAACTGTACA TTGTGAGCTC TGGTTATTTT TCTCTTGTAC CATAGAAAAA TGTATAAAAA 3 3 86 

15 TTATCAAAAA GCTAATGTGC AGGGATATTG CCTTATTTGT CTGTAAAAAA TGGAGCTCAG 344 6 

TAACATAACT GCTTCTTGGA GCTTTGGAAT ATTTTATCCT GTATTCTTGT TT 34 98 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 63 amino acids 

20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



25 



30 



35 



40 



45 



50 



55 



60 





(xi) : 


SEQUENCE 


DESCRIPTION: SEQ ID 


NO: 


6 : 










Met 


Gin 


Gly 


Lys 


Arg 


Lys 


Ala 


Leu Lys 


Leu 


Asn 


Phe 


Ala 


Asn 


Pro 


Pro 


1 








5 








10 










15 




Phe 


Lys 


Ser 


Thr 


Ala 


Arg 


Phe 


Thr Leu 


Asn 


Pro 


Asn 


Pro 


Thr Gly Val 








20 








25 










30 






Gin 


Asn 


Pro 


His 


He 


Glu 


Arg 


Leu Arg 


Thr 


His 


Ser 


He 


Glu 


Ser 


Ser 






35 










40 








45 








Gly 


Lys 


Leu 


Lys 


He 


Ser 


Pro 


Glu Gin 


His 


Trp 


Asp 


Phe 


Thr 


Ala 


Glu 




50 










55 






60 










Asp 


Leu 


Lys 


Asp 


Leu 


Gly 


Glu 


He Gly 


Arg 


Gly 


Ala 


Tyr 


Gly 


Ser 


Val 


65 










70 








75 








80 


Asn 


Lys 


Met 


Val 


His 


Lys 


Pro 


Ser Gly 


Gin 


He 


Met 


Ala 


Val 


Lys 


Arg 










85 








90 










95 


He 


Arg 


Ser 


Thr 


Val 


Asp 


Glu 


Lys Glu 


Gin 


Lys 


Gin 


Leu 


Leu 


Met 


Asp 








100 








105 










110 




Leu 


Asp 


Val 


Val 


Met 


Arg 


Ser 


Ser Asp 


Cys 


Pro 


Tyr 


He 


Val 


Gin 


Phe 






115 










12 0 








125 








Tyr 


Gly 


Ala 


Leu 


Phe 


Arg 


Glu 


Gly Asp 


Cys 


Trp 


He 


Cys 


Met 


Glu 


Leu 




130 










135 








140 








Met 


Ser 


Thr 


Ser 


Phe 


Asp 


Lys 


Phe Tyr 


Lys 


Tyr 


Val 


Tyr 


Ser 


Val 


Leu 


145 










150 








155 








160 


Asp 


Asp 


Val 


He 


Pro 


Glu 


Glu 


He Leu 


Gly 


Lys 


He 


Thr 


Leu 


Ala 


Thr 










165 








170 










175 




Val 


Lys 


Ala 


Leu 


Asn 


His 


Leu 


Lys Glu 


Asn 


Leu 


Lys 


He 


He 


His 


Arg 








180 








185 










190 




Asp 


He 


Lys 


Pro 


Ser 


Asn 


He 


Leu Leu 


Asp 


Arg 


Ser 


Gly 


Asn 


He 


Lys 






195 










200 








205 






Leu 


Cys 


Asp 


Phe 


Gly 


He 


Ser 


Gly Gin 


Leu 


Val 


Asp 


Ser 


He 


Ala 


Lys 




210 










215 








220 








Thr 


Arg 


Asp 


Ala 


Gly 


Cys 


Arg 


Pro Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


He 


Asp 


225 










230 








235 








240 


Pro 


Ser 


Ala 


Ser 


Arg 


Gin 


Gly 


Tyr Asp 


Val 


Arg 


Ser 


Asp 


Val 


Trp 


Ser 










245 








250 








255 




Leu 


Gly 


He 


Thr 


Leu 


Tyr 


Glu 


Leu Ala 


Thr 


Gly Arg 


Phe 


Pro 


Tyr 


Pro 








260 








265 










270 




Lys 


Trp 


Asn 


Ser 


Val 


Phe 


Asp 


Gin Leu 


Thr 


Gin 


Val 


Val 


Lys 


Gly Asp 






275 










280 








285 
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5 



Pro 


Pro 


Gin 


Leu 


Ser 


Asn 


Ser 


Glu 


Glu 


Arg 


Glu 


Phe 


Ser 


Pro 


Ser 


Phe 




290 










295 








300 










He 


Asn 


Phe 


Val 


Asn 


Leu 


Cys 


Leu 


Thr 


Lys 


Asp 


Glu 


Ser 


Lys 


Arg 


Pro 


305 










310 










315 






320 


Lys 


Tyr 


Lys 


Glu 


Leu 


Leu 


Lys 


His 


Pro 


Phe 


He 


Leu 


Met 


Tyr 


Glu 


Glu 










325 










330 








335 




Arg 


Ala 


Val 


Glu 
340 


Val 


Ala 


Cys 


Tyr 


Val 
345 


Cys 


Lys 


He 


Leu 


Asp 
350 


Gin 


Met 


Pro 


Ala 


Thr 


Pro 


Ser 


Ser 


Pro 


Met 


Tyr 


Val 


Asp 













10 355 360 



(2) INFORMATION FOR SEQ ID NO : 7 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 54 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 
20 (B) LOCATION: 6 . . . 1184 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CAACA ATG GCG GCT CCG AGC CCG AGC GGT GGC GGC GGC AGC GGC ACC CCC 5 0 

Met Ala Ala Pro Ser Pro Ser Gly Gly Gly Gly Ser Gly Thr Pro 
15 10 ' 15 

25 GGC CCC GTA GGG TCC CCG GCG CCA GGC CAC CCG GCC GTC AGC AGC ATG 9 8 

Gly Pro Val Gly Ser Pro Ala Pro Gly His Pro Ala Val Ser Ser Met 
20 25 30 

CAG GGT AAA CGC AAA GCA CTG AAG TTG AAT TTT GCA AAT CCA CCT TTC 14 6 

3 0 Gin Gly Lys Arg Lys Ala Leu Lys Leu Asn Phe Ala Asn Pro Pro Phe 

35 40 45 

AAA TCT ACA GCA AGG TTT ACT CTG AAT CCC AAT CCT ACA GGA GTT CAA 194 

Lys Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn Pro Thr Gly Val Gin 

5 0 55 60 

35 

AAC CCA CAC ATA GAG AGA CTG AGA ACA CAC AGC ATT GAG TCA TCA GGA 242 

Asn Pro His He Glu Arg Leu Arg Thr His Ser He Glu Ser Ser Gly 

65 70 75 

AAA CTG AAG ATC TCC CCT GAA CAA CAC TGG GAT TTC ACT GCA GAG GAC 2 90 

Lys Leu Lys He Ser Pro Glu Gin His Trp Asp Phe Thr Ala Glu Asp 
80 85 90 95 

TTG AAA GAC CTT GGA GAA ATT GGA CGA GGA GCT TAT GGT TCT GTC AAC 33 8 

Leu Lys Asp Leu Gly Glu He Gly Arg Gly Ala Tyr Gly Ser Val Asn 
100 105 110 

45 AAA ATG GTC CAC AAA CCA AGT GGG CAA ATA ATG GCA GTT AAA AGA ATT 3 86 

Lys Met Val His Lys Pro Ser Gly Gin He Met Ala Val Lys Arg He 
115 120 125 

CGG TCA ACA GTG GAT GAA AAA GAA CAA AAA CAA CTT CTT ATG GAT TTG 434 
Arg Ser Thr Val Asp Glu Lys Glu Gin Lys Gin Leu Leu Met Asp Leu 
50 130 135 140 



40 
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GAT GTA GTA ATG CGG AGT AGT GAT TGC CCA TAC ATT GTT CAG TTT TAT 4 82 

Asp Val Val Met Arg Ser Ser Asp Cys Pro Tyr lie Val Gin Phe Tyr 
145 150 155 

GGT GCA CTC TTC AGA GAG GGT GAC TGT TGG ATC TGT ATG GAA CTC ATG 53 0 

5 Gly Ala Leu Phe Arg Glu Gly Asp Cys Trp lie Cys Met Glu Leu Met 

160 165 170 175 

TCT ACC TCG TTT GAT AAG TTT TAC AAA TAT GTA TAT AGT GTA TTA GAT 57 8 

Ser Thr Ser Phe Asp Lys Phe Tyr Lys Tyr Val Tyr Ser Val Leu Asp 
180 185 190 

10 GAT GTT ATT CCA GAA GAA ATT TTA GGC AAA ATC ACT TTA GCA ACT GTG 62 6 

Asp Val He Pro Glu Glu He Leu Gly Lys He Thr Leu Ala Thr Val 
195 200 205 

AAA GCA CTA AAC CAC TTA AAA GAA AAC TTG AAA ATT ATT CAC AGA GAT 674 
Lys Ala Leu Asn His Leu Lys Glu Asn Leu Lys He He His Arg Asp 
15 210 215 " 220 

ATC AAA CCT TCC AAT ATT CTT CTG GAC AGA AGT GGA AAT ATT AAG CTC 72 2 

He Lys Pro Ser Asn He Leu Leu Asp Arg Ser Gly Asn He Lys Leu 
225 230 235 



20 



TGT GAC TTC GGC ATC AGT GGA CAG CTT GTG GAC TCT ATT GCC AAG ACA 77 0 

Cys Asp Phe Gly He Ser Gly Gin Leu Val Asp Ser He Ala Lys Thr 
240 245 250 255 

AGA GAT GCT GGC TGT AGG CCA TAC ATG GCA CCT GAA AGA ATA GAC CCA 818 
Arg Asp Ala Gly Cys Arg Pro Tyr Met Ala Pro Glu Arg He Asp Pro 
260 265 270 

2 5 AGC GCA TCA CGA CAA GGA TAT GAT GTC CGC TCT GAT GTC TGG AGT TTG 8 66 

Ser Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser Asp Val Trp Ser Leu 
275 280 285 

GGG ATC ACA TTG TAT GAG TTG GCC ACA GGC CGA TTT CCT TAT CCA AAG 914 
Gly He Thr Leu Tyr Glu Leu Ala Thr Gly Arg Phe Pro Tyr Pro Lys 
30 290 295 300 

TGG AAT AGT GTA TTT GAT CAA CTA ACA CAA GTC GTG AAA GGA GAT CCT 962 
Trp Asn Ser Val Phe Asp Gin Leu Thr Gin Val Val Lys Gly Asp Pro 
305 310 315 

CCG CAG CTG AGT AAT TCT GAG GAA AGG GAA TTC TCC CCG AGT TTC ATC 1010 
35 Pro Gin Leu Ser Asn Ser Glu Glu Arg Glu Phe Ser Pro Ser Phe He 

320 325 330 335 

AAC TTT GTC AAC TTG TGC CTT ACG AAG GAT GAA TCC AAA AGG CCA AAG 105 8 

Asn Phe Val Asn Leu Cys Leu Thr Lys Asp Glu Ser Lys Arg Pro Lys 
340 345 350 

4 0 TAT AAA GAG CTT CTG AAA CAT CCC TTT ATT TTG ATG TAT GAA GAA CGT 110 6 

Tyr Lys Glu Leu Leu Lys His Pro Phe He Leu Met Tyr Glu Glu Arg 
355 360 365 

GCC GTT GAG GTC GCA TGC TAT GTT TGT AAA ATC CTG GAT CAA ATG CCA 1154 
Ala Val Glu Val Ala Cys Tyr Val Cys Lys He Leu Asp Gin Met Pro 
45 370 375 380 

GCT ACT CCC AGC TCT CCC ATG TAT GTC GAT TGATATCGYT GCTACATCAG ACT 12 07 
Ala Thr Pro Ser Ser Pro Met Tyr Val Asp 
385 390 

CTAGAAAAAA GGGCTGAGAG GAAGCAAGAC GTAAAGAATT TTCATCCCGT ATCACAGTGT 1267 
50 TTTTATTGCT CGCCCAGACA CCATGTGCAA TAAGATTGGT GTTCGTTTCC ATCATGTCTG 13 2 7 

TATACTCCTG TCACCTAGAA CGTGCATCCT TGT AAT AC C T GATTGATCAC ACAGTGTTAG 13 8 7 



WO 99/02547 



PCT/US98/14101 



TGCTGGTCAG AGAGACCTCA TCCTGCTCTT 
GTCAGTACGA TCAAGTTGTT GACTGTGATT 
AAAACCTGGA GATGCAGCTA CTGGAATGGT 
CACAGTGATG AATGTACTAT ATCTGAACAT 
5 CACAGCCAAC GAGACACATT GCCTTCTGGA 

TCACCAAGTG CAATAGATTA CTGATGTGAT 
TTGGGGATCG ATGTGCTCAG CCAAATTTCC 
ATTTATCTTT ACCAAAAACC ATGTTGCGTT 
GGACAGAATG TGTTCTTTTC TCCTCTACCA 

10 TCTGCCACTT GTCAAAGAAG GTGCTGATCC 

GCTGCCAACT TGATGTTCCA CCTGCCACAA 
AGAAGG C AAA GTTTACAGAT GTTTTTAATT 
GCTATATATT TCCCCTTGGT CCCAAGCCTG 
GAGAAGTAGC TAGTAGCAAT GTGCCTTGAT 

15 AAAGAC C AAA TCTCAGTTGT TTGCTTCTTG 

ATCTCTTTCC CTTCCCCTGT GGTCTATTGT 
TTGCCTTTTT TCTATATCAA AAAACCTTTA 
TTTTAACCCC CAATC TCTCA TAATCGCTAG 
ACCGATGTGG TAGGTGATAA AGAGGCATCT 

2 0 TCCGAAACGG CAGCCTTTAC GTTCATCACC 

TCTTGGCATT GGAATTCTAC TGGAAAAAAA 
GTTACAAGAG GCCATTTAAG TATCTTGTGC 
TTAGGTTTTG CTTGGGCCTC CCTGGCACTG 
CACTGTGAGT GGTTCAAGCA CACTGGAATA 

2 5 TG C C ATT AC A GAACCAAATC GTGGCACGTA 

TAAATACTGT CAAACAATAA AGGGAGAATG 
GCAGAATTCA AAAGTGATTA TCTCTTTGAT 
CACGGTATCC TAAACTTTAG ACTTCCCACT 
CCTTCGACCA CAGCAAGCCA TCATCCTCCA 

3 0 TCTCTGCTGT CAACTTCCCA TCTGGCTCAG 

AG AT AAAAG C AATTCTGGCT GTCCAGGAGC 
ACATAAGAAG GCAATTTTAG TGTATTAATC 
CAAGGAGTTT ATTACAATGT ATCTTTATTA 
GTGAAAATAG TGTAAGAACT GTACATTGTG 
35 AAAAATGTAT AAAAATTATC AAAAAGCTAA 

AAAAATGGAG CTCAGTAACA TAACTGCTTC 
CTTGTTT 



12 



TTGTGATGAA CATATTCATG AAATGTGGAA 144 7 

AGATCACATC TTAAATTCAT TTCTAGACTC 15 07 

GTTTTGTCAG ACTTCCAAAT CCTGGAAGGA 1567 

AGAAACTCGG GCTTGAGTGA GAAGAGCTTG 162 7 

GCTGGGAGAC AAAGGAGGAA TTTACTTTCT 1687 

ATTCTGTTGC TTTACAGTTA CAGTTGATGT 174 7 

TGTTTGAAAT ATCATGTTAA ATTAGAATGA 180 7 

CAAAGAGGTG AACATTAAAA TATAGAGACA 1867 

GTCCTATTTT TCAATGGGAA GACT CAGGAG 192 7 

TAAGAATTTT TCATTCTCAG AATTCGGTGT 1987 

AC C AC C AGGA CTGAAAGAAG AAAACAGTAC 2 04 7 

CTAGTATTTT ATCTGGAACA ACTTGTAGCA 2107 

ATACTTTAGC CATCATAACT CACTAACAGG 2167 

TGATTAGATA AAGATTTCTA GTAGGCAGCA 222 7 

CCATCACTGG TCCAGGTCTT CAGTTTCCGA 2 2 87 

CGCTATGTGA CTTGCGCTTA ATCCAATATT 2 34 7 

CAGTTAGCAG GGATGTTCCT TACCGAGGAT 2 4 07 

TGTTTAAAAG GCTAAGAATA GTGGGGCCCA 24 67 

TTTCTAGAGA CACATTGGAC CAGATGAGGA 2 52 7 

TGCTAGAACC TCTCGTAGTC CATCACCATT 2 587 

TACAAAAAGC AAAACAAAAC CCTCAGCACT 2 64 7 

TTCTTCACTT ACCCATTAGC CAGGTTCTCA 2 7 07 

AACCTTAGGC TTTGTATGAC AGTGAAGCAG 2 7 67 

TAAAACAGTC ATGGCCTGAG ATGCAGGTGA 2 82 7 

TTGCTGTGTC TCCTCTCAGA GTGACAGTCA 2 8 87 

GTGCTGTTTA AAGTCACATC CCTGTAAATT 2 947 

CTACTTGCCT CATTTCCCTA TCTTCTCCCC 3 0 07 

GTTCTGAAAG GAGACATTGC TCTATGTCTG 3 0 67 

TTGCTCCCGG GGACTCAAGA GGAATCTGTT 312 7 

CATAGGGTCA CTTTGCCATT ATGCAAATGG 3187 

TAATCTGACC GTTCTATTGT GTGGATG AC C 3247 

ATAGATTATT ATAAACTATA AACTTAAGGG 3 3 07 

AAACAAAAGG GTGTATAGTG TTCACAAACT 33 67 

AGCTCTGGTT ATTTTTCTCT TGTACCATAG 34 2 7 

TGTGCAGGGA TATTGCCTTA TTTGTCTGTA 34 87 

TTGGAGCTTT GGAATATTTT ATCCTGTATT 3 54 7 

3554 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



50 



55 



Met 


Ala 


Ala 


Pro 


Ser 


Pro 


Ser 


Gly Gly 


Gly 


Gly 


Ser Gly Thr Pro 


Gly 


1 








5 








10 




15 


Pro 


Val 


Gly 


Ser 


Pro 


Ala 


Pro 


Gly His 


Pro 


Ala 


Val Ser Ser Met 


Gin 








20 








25 






30 




Gly 


Lys 


Arg 


Lys 


Ala 


Leu 


Lys 


Leu Asn 


Phe 


Ala 


Asn Pro Pro Phe 


Lys 






35 










40 






45 


Ser 


Thr 


Ala 


Arg 


Phe 


Thr 


Leu 


Asn Pro 


Asn 


Pro 


Thr Gly Val Gin 


Asn 




50 










55 








60 




Pro 


His 


He 


Glu 


Arg 


Leu 


Arg 


Thr His 


Ser 


He 


Glu Ser Ser Gly 


Lys 


65 










70 








75 


80 


Leu 


Lys 


He 


Ser 


Pro 


Glu 


Gin 


His Trp 


Asp 


Phe 


Thr Ala Glu Asp 


Leu 










85 








90 




95 




Lys 


Asp 


Leu 


Gly 


Glu 


He 


Gly 


Arg Gly 


Ala 


Tyr 


Gly Ser Val Asn 


Lys 








100 








105 






110 
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Met 


Val 


His 


Lys 


Pro 


C? fia -v~ 
OCX 




vjin lie 


1*1 C2 L. 


i-ij. a 


vajL 


Lys 


Arg 


lie 


Arg 








115 










120 








X ^ -J 






Ser 


Thr 


Val 


Asp 


Glu 


Lys 


Glu 


vjj.1i ¥ o 


dl ri 

OXll 


Leu 


Leu 


i v ie c 


Asp 


Leu 


Asp 






130 










135 








140 






5 


Val 


Val 


Met 


Arcr 


Ser 


Ser 


Asp 


fve; Pro 


±yr 


lie 


V ct _L 


m n 

uin 


ir ne 


Tyr 


oly 




14 5 










150 








155 








X D U 




Ala 


Leu 


Phe 


Arcr 


Glu 
165 


Glv 


Asp 


Cys Trp 


He 
170 


Cys 


Met 


Glu 


Xj t: LI 


175 


Ser 


10 


Thr 


Ser 


Phe 


Asp 


Lvs 


Phe 


Tvr 


Lys Tyr 


Val 


Tyr 


Ser 


Val 


Leu 


Asp 


Asp 








180 








185 










190 




Val 


He 


Pro 


Glu 


Glu 


He 


Leu 


Gly Lys 


He 


Thr 


Leu 


Ala 


X IJ.X 


V ctX 


Lys 








195 










200 








205 








Ala 


Leu 


Asn 


His 


Leu 


Lys 


Glu 




Lys 


He 


He 


XIX io 


Arg 


Asp 


lie 






210 










215 








220 






15 


Lvs 


Pro 


Ser 


Asn 


He 


Leu 


Leu 




Ser 


Gly Asn 


Tip 


ijy s 


Leu 


Cys 




225 










230 








235 








Z ft u 




Asp 


Phe 


Glv 


He 


Ser 


Glv 


Gin 


Leu Val 


Asp 


Ser 


He 


J-iX d 


Lys 


1 ill 


Arg 












245 








250 








255 


20 


Asp 


Ala 


Gly 


Cvs 


Arcr 


Pro 


Tvr 


Met Ala 


Pro 


Glu 


Arg 


Tip 
X X c: 


ribp 


Pro 


Ser 








260 








265 








270 








Ala 


Ser 


Arg 


Gin 


Glv 


Tvr 
j 


Asp 


Val Arg 


Ser 


Asp 


Val 


T , T~r-\ 

xrp 


Ser 


Leu 


f~*l i r 








275 










2 80 








285 








He 


Thr 


Leu 


Tvr 


Glu 


Leu 


Ala 


Thr Gly 


Arg 


Phe 


Pro 


±yr 


rl L) 


Lys 


Trp 






290 










295 








300 






25 


Asn 


Ser 


Val 


Phe 


Asp 


Gin 


Leu 


Thr Gin 


Val 


Val 


Lys 


uiy 


Asp 


Pro 


Pro 




305 










310 








315 




*3 O A 




Gin 


Leu 


Ser 


Asn 


Ser 
325 


Glu 


Glu 




Phe 
330 


Ser 


Pro 


Cot- 
OCX 


ir ne 


xj_e 
335 


Asn 


30 


Phe 


Val 


Asn 


Leu 


Cys 


Leu 


Thr 


Lys Asp 


Glu 


Ser 


Lys 


Arg 


Pro 


Lys 


Tyr 








340 








345 








350 




Lys 


Glu 


Leu 


Leu 


Lys 


His 


Pro 


Phe He 


Leu 


Met 


Tyr 


Glu 


Glu 


Arg 


Ala 








355 










360 








365 








Val 


Glu 
370 


Val 


Ala 


Cys 


Tyr 


Val 
375 


Cys Lys 


He 


Leu 


Asp 
380 


Gin 


Met 


Pro 


Ala 


35 


Thr 


Pro 


Ser 


Ser 


Pro 


Met 


Tyr 


Val Asp 


















385 










390 

















(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 57 6 base pairs 

40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
45 (A) NAME /KEY : Coding Sequence 

(B) LOCATION: 10... 1206 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

CTCCCAACA ATG GCG GCT CCG AGC CCG AGC GGC GGC GGC GGC TCC GGG GGC 51 
Met Ala Ala Pro Ser Pro Ser Gly Gly Gly Gly Ser Gly Gly 
50 15 10 

GGC AGC GGC AGC GGC ACC CCC GGC CCC GTA GGG TCC CCG GCG CCA GGC 9 9 

Gly Ser Gly Ser Gly Thr Pro Gly Pro Val Gly Ser Pro Ala Pro Gly 
15 20 25 30 

CAC CCG GCC GTC AGC AGC ATG CAG GGT AAA CGC AAA GCA CTG AAG TTG 14 7 

55 His Pro Ala Val Ser Ser Met Gin Gly Lys Arg Lys Ala Leu Lys Leu 

35 40 45 
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AAT TTT GCA AAT CCA CCT TTC AAA TCT ACA GCA AGG TTT ACT CTG AAT 195 
Asn Phe Ala Asn Pro Pro Phe Lys Ser Thr Ala Arg Phe Thr Leu Asn 
50 55 " 60 

CCC AAT CCT ACA GGA GTT CAA AAC CCA CAC ATA GAG AGA CTG AGA ACA 24 3 

5 Pro Asn Pro Thr Gly Val Gin Asn Pro His lie Glu Arg Leu Arg Thr 

65 70 75 

CAC AGC ATT GAG TCA TCA GGA AAA CTG AAG ATC TCC CCT GAA CAA CAC 2 91 

His Ser lie Glu Ser Ser Gly Lys Leu Lys lie Ser Pro Glu Gin His 
80 85 90 

10 TGG GAT TTC ACT GCA GAG GAC TTG AAA GAC CTT GGA GAA ATT GGA CGA 33 9 

Trp Asp Phe Thr Ala Glu Asp Leu Lys Asp Leu Gly Glu lie Gly Arg 
95 100 105 110 

GGA GCT TAT GGT TCT GTC AAC AAA ATG GTC CAC AAA CCA AGT GGG CAA 3 87 

Gly Ala Tyr Gly Ser Val Asn Lys Met Val His Lys Pro Ser Gly Gin 
15 115 120 " 125 

ATA ATG GCA GTT AAA AGA ATT CGG TCA ACA GTG GAT GAA AAA GAA CAA 43 5 

lie Met Ala Val Lys Arg lie Arg Ser Thr Val Asp Glu Lys Glu Gin 
130 135 140 



20 



AAA CAA CTT CTT ATG GAT TTG GAT GTA GTA ATG CGG AGT AGT GAT TGC 4 83 

Lys Gin Leu Leu Met Asp Leu Asp Val Val Met Arg Ser Ser Asp Cys 
145 150 155 

CCA TAC ATT GTT CAG TTT TAT GGT GCA CTC TTC AGA GAG GGT GAC TGT 531 
Pro Tyr lie Val Gin Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Cys 
160 165 170 

25 TGG ATC TGT ATG GAA CTC ATG TCT ACC TCG TTT GAT AAG TTT TAC AAA 5 79 

Trp lie Cys Met Glu Leu Met Ser Thr Ser Phe Asp Lys Phe Tyr Lys 
175 180 185 " ~ 190 

TAT GTA TAT AGT GTA TTA GAT GAT GTT ATT CCA GAA GAA ATT TTA GGC 62 7 

Tyr Val Tyr Ser Val Leu Asp Asp Val He Pro Glu Glu He Leu Gly 
30 195 200 205 

AAA ATC ACT TTA GCA ACT GTG AAA GCA CTA AAC CAC TTA AAA GAA AAC 6 75 

Lys He Thr Leu Ala Thr Val Lys Ala Leu Asn His Leu Lys Glu Asn 
210 215 220 

TTG AAA ATT ATT CAC AGA GAT ATC AAA CCT TCC AAT ATT CTT CTG GAC 723 
35 Leu Lys He He His Arg Asp He Lys Pro Ser Asn He Leu Leu Asp 

225 230 235 

AGA AGT GGA AAT ATT AAG CTC TGT GAC TTC GGC ATC AGT GGA CAG CTT 7 71 

Arg Ser Gly Asn He Lys Leu Cys Asp Phe Gly He Ser Gly Gin Leu 
240 245 250 

4 0 GTG GAC TCT ATT GCC AAG ACA AGA GAT GCT GGC TGT AGG CCA TAC ATG 819 

Val Asp Ser He Ala Lys Thr Arg Asp Ala Gly Cys Arg Pro Tyr Met 
255 260 265 270 

GCA CCT GAA AGA ATA GAC CCA AGC GCA TCA CGA CAA GGA TAT GAT GTC 8 67 

Ala Pro Glu Arg He Asp Pro Ser Ala Ser Arg Gin Gly Tyr Asp Val 
45 275 280 285 

CGC TCT GAT GTC TGG AGT TTG GGG ATC ACA TTG TAT GAG TTG GCC ACA 915 
Arg Ser Asp Val Trp Ser Leu Gly He Thr Leu Tyr Glu Leu Ala Thr 
290 295 300 

GGC CGA TTT CCT TAT CCA AAG TGG AAT AGT GTA TTT GAT CAA CTA ACA 963 

5 0 Gly Arg Phe Pro Tyr Pro Lys Trp Asn Ser Val Phe Asp Gin Leu Thr 

305 310 315 
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15 

CAA GTC GTG AAA GGA GAT CCT CCG CAG CTG AGT AAT TCT GAG GAA AGG 1011 
Gin Val Val Lys Gly Asp Pro Pro Gin Leu Ser Asn Ser Glu Glu Arg 
320 325 330 

GAA TTC TCC CCG AGT TTC ATC AAC TTT GTC AAC TTG TGC CTT ACG AAG 1059 
Glu Phe Ser Pro Ser Phe lie Asn Phe Val Asn Leu Cys Leu Thr Lys 
335 340 345 ~ 350 

GAT GAA TCC AAA AGG CCA AAG TAT AAA GAG CTT CTG AAA CAT CCC TTT 1107 
Asp Glu Ser Lys Arg Pro Lys Tyr Lys Glu Leu Leu Lys His Pro Phe 
355 360 365 

ATT TTG ATG TAT GAA GAA CGT GCC GTT GAG GTC GCA TGC TAT GTT TGT 1155 
lie Leu Met Tyr Glu Glu Arg Ala Val Glu Val Ala Cys Tyr Val Cys 
370 375 380 



AAA ATC CTG GAT CAA ATG CCA GCT ACT CCC AGC TCT CCC ATG TAT GTC 12 03 

Lys lie Leu Asp Gin Met Pro Ala Thr Pro Ser Ser Pro Met Tyr Val 
15 385 390 395 

GAT TGATATCGCT GCTACATCAG ACTCTAGAAA AAAGGGCTGA G AGGAAG CAA GACGTA 12 62 
Asp 



AAGAATTTTC ATCCCGTATC ACAGTGTTTT 
GATTGGTGTT CGTTTCCATC ATGTCTGTAT 

2 0 AATACCTGAT TGATCACACA GTGTTAGTGC 

TGATGAACAT ATTCATGAAA TGTGGAAGTC 
TCACATCTTA AATTCATTTC TAGACTCAAA 
TTGTCAGACT TCCAAATCCT GGAAGGACAC 
AACTCGGGCT TGAGTGAGAA GAGCTTGCAC 
25 GGGAGACAAA GGAGGAATTT ACTTTCTTCA 

CTGTTGCTTT ACAGTTACAG TTGATGTTTG 
TTGAAATATC ATGTTAAATT AGAATGAATT 
AGAGGTGAAC ATTAAAATAT AGAGACAGGA 
CTATTTTTCA ATGGGAAGAC TCAGGAGTCT 

3 0 GAATTTTTCA TTCTCAGAAT TCGGTGTGCT 

ACCAGGACTG AAAGAAGAAA ACAGTACAGA 
GTATTTTATC TGGAACAACT TGTAGCAGCT 
CTTTAGCCAT CATAACTCAC TAACAGGGAG 
TTAGATAAAG ATTTCTAGTA GGCAGCAAAA 

3 5 TCACTGGTCC AGGTCTTCAG TTTCCGAATC 

TATGTGACTT GCGCTTAATC CAATATTTTG 
TTAGCAGGGA TGTTCCTTAC CGAGGATTTT 
TTAAAAGGCT AAGAATAGTG GGGCCCAACC 
CTAGAGACAC ATTGGAC CAG ATGAGGATCC 

4 0 TAGAACCTCT CGTAGTCCAT CACCATTTCT 

AAAAAGCAAA ACAAAACCCT CAGCACTGTT 
TTCACTTACC CATTAGCCAG GTTCT CATTA 
CTTAGGCTTT GTATGACAGT GAAGCAGCAC 
AACAGTCATG GCCTGAGATG CAGGTGATGC 

45 CTGTGTCTCC TCTCAGAGTG AC AGT C ATAA 

CTGTTTAAAG TCACATCCCT GTAAATTGCA 
CTTGCCTCAT TTCCCTATCT TCTCCCCCAC 
C TGAAAGG AG ACATTGCTCT ATGTCTGCCT 
CTCCCGGGGA CTCAAGAGGA ATCTGTTTCT 

50 AGGGTCACTT TGCCATTATG CAAATGGAGA 

TCTGACCGTT CTATTGTGTG GATGACCACA 
GATTATTATA AACTATAAAC TTAAGGGCAA 
CAAAAGGGTG TATAGTGTTC ACAAACTGTG 
TCTGGTTATT TTTCTCTTGT AC C ATAGAAA 

55 GCAGGGATAT TGCCTTATTT GTCTGTAAAA 

GAGCTTTGGA ATATTTTATC CTGTATTCTT 



TATTGCTCGC CCAGACACCA TGTGCAATAA 13 2 2 

ACTCCTGTCA CCTAGAACGT GCATCCTTGT 13 82 

TGGTCAGAGA GACCTCATCC TGCTCTTTTG 1442 

AGTACGATCA AGTTGTTGAC TGTGATTAGA 15 02 

AC C TGGAG AT GCAGCTACTG GAATGGTGTT 15 62 

AGTGATGAAT GTAC TAT AT C TGAACATAGA 1622 

AGC C AAC GAG ACACATTGCC TTCTGGAGCT 16 82 

CCAAGTGCAA TAGATTACTG ATGTGATATT 1742 

GGGATCGATG TGCTCAGCCA AATTTCCTGT 18 02 

TATCTTTACC AAAAAC CATG TTGCGTTCAA 1862 

CAGAATGTGT TCTTTTCTCC TCTACCAGTC 1922 

GCCACTTGTC AAAGAAGGTG CTGATCCTAA 19 82 

GCCAACTTGA TGTTCCACCT GCCACAAACC 2 042 

AGGCAAAGTT TACAGATGTT TTTAATTCTA 2102 

ATATATTTCC CCTTGGTCCC AAGCCTGATA 2162 

AAGTAGCTAG TAGCAATGTG CCTTGATTGA 2 222 

GACCAAATCT CAGTTGTTTG CTTCTTGCCA 22 82 

TCTTTCCCTT CCCCTGTGGT CTATTGTCGC 2 342 

CCTTTTTTCT AT AT C AAAAA ACCTTTACAG 24 02 

TAACCCCCAA TCTCTCATAA TCGCTAGTGT 24 62 

GATGTGGTAG GTGATAAAGA GGCATCTTTT 252 2 

GAAACGGCAG CCTTTACGTT CATCACCTGC 2 5 82 

TGGCATTGGA ATTCTACTGG AAAAAAATAC 2642 

ACAAGAGGCC ATTTAAGTAT CTTGTGCTTC 27 02 

GGTTTTGCTT GGGCCTCCCT GGCACTGAAC 2 7 62 

TGTGAGTGGT TCAAGCACAC TGGAATATAA 2 822 

CATTACAGAA CCAAATCGTG GCACGTATTG 2 8 82 

ATACTGTCAA ACAATAAAGG GAGAATGGTG 2 94 2 

GAATTCAAAA GTGATTATCT CTTTGATCTA 3 002 

GGTATCCTAA ACTTTAGACT TCCCACTGTT 3 0 62 

TCGACCACAG CAAGCCATCA TCCTCCATTG 312 2 

CTGCTGTCAA CTTCCCATCT GGCTCAGCAT 3182 

TAAAAGCAAT TCTGGCTGTC CAGGAGCTAA 3242 

TAAGAAGGCA ATTTTAGTGT ATTAATCATA 3 3 02 

GGAGTTTATT ACAATGTATC TTTATTAAAA 3 3 62 

AAAATAGTGT AAGAACTGTA C ATTGTG AG C 3 422 

AATGTATAAA AATTATCAAA AAGCTAATGT 34 82 

AATGGAGCTC AGTAACATAA CTGCTTCTTG 3 542 

GTTT 3576 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Pro Ser Pro Ser Gly Gly Gly Gly Ser Gly Gly Gly Ser 
10 1 5 10 ' 15 

Gly Ser Gly Thr Pro Gly Pro Val Gly Ser Pro Ala Pro Gly His Pro 

20 25 30 

Ala Val Ser Ser Met Gin Gly Lys Arg Lys Ala Leu Lys Leu Asn Phe 
35 40 45 

15 Ala Asn Pro Pro Phe Lys Ser Thr Ala Arg Phe Thr Leu Asn Pro Asn 

50 55 60 

Pro Thr Gly Val Gin Asn Pro His lie Glu Arg Leu Arg Thr His Ser 
65 70 75 80 

lie Glu Ser Ser Gly Lys Leu Lys lie Ser Pro Glu Gin His Trp Asp 
20 85 90 95 

Phe Thr Ala Glu Asp Leu Lys Asp Leu Gly Glu lie Gly Arg Gly Ala 

100 105 " 110 

Tyr Gly Ser Val Asn Lys Met Val His Lys Pro Ser Gly Gin lie Met 
115 120 125 

25 Ala Val Lys Arg lie Arg Ser Thr Val Asp Glu Lys Glu Gin Lys Gin 

130 135 140 

Leu Leu Met Asp Leu Asp Val Val Met Arg Ser Ser Asp Cys Pro Tyr 
145 150 155 ~ 1 160 

lie Val Gin Phe Tyr Gly Ala Leu Phe Arg Glu Gly Asp Cys Trp lie 
30 165 170 175 

Cys Met Glu Leu Met Ser Thr Ser Phe Asp Lys Phe Tyr Lys Tyr Val 

180 185 ~ 190 

Tyr Ser Val Leu Asp Asp Val lie Pro Glu Glu lie Leu Gly Lys lie 
195 200 205 

3 5 Thr Leu Ala Thr Val Lys Ala Leu Asn His Leu Lys Glu Asn Leu Lys 

210 215 220 

lie lie His Arg Asp lie Lys Pro Ser Asn lie Leu Leu Asp Arg Ser 
22 5 230 235 " 240 

Gly Asn lie Lys Leu Cys Asp Phe Gly lie Ser Gly Gin Leu Val Asp 
40 245 250 255 

Ser lie Ala Lys Thr Arg Asp Ala Gly Cys Arg Pro Tyr Met Ala Pro 

260 265 ~ 270 

Glu Arg lie Asp Pro Ser Ala Ser Arg Gin Gly Tyr Asp Val Arg Ser 
275 280 285 

4 5 Asp Val Trp Ser Leu Gly lie Thr Leu Tyr Glu Leu Ala Thr Gly Arg 

290 295 300 

Phe Pro Tyr Pro Lys Trp Asn Ser Val Phe Asp Gin Leu Thr Gin Val 
3 °5 310 315 320 

Val Lys Gly Asp Pro Pro Gin Leu Ser Asn Ser Glu Glu Arg Glu Phe 
50 325 330 335 

Ser Pro Ser Phe lie Asn Phe Val Asn Leu Cys Leu Thr Lys Asp Glu 

340 345 350 

Ser Lys Arg Pro Lys Tyr Lys Glu Leu Leu Lys His Pro Phe lie Leu 
355 360 365 

55 Met Tyr Glu Glu Arg Ala Val Glu Val Ala Cys Tyr Val Cys Lys lie 

370 375 380 

Leu Asp Gin Met Pro Ala Thr Pro Ser Ser Pro Met Tyr Val Asp 
385 390 395 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met 


Pro 


Lys 


Lys 


Lys 


Pro 


Thr 


ir X. (_) 


Tip 

lie 


uin 


Leu 


Asn 


Pro 


Ala 


Pro 


Asp 


1 








5 




















15 


Gly 


Ser 


Ala 


Val 


Asn Gly Thr 


Qov 


o6r 


i\± 3. 


Glu 


Thr 


Asn 


Leu 


Glu 


-TV *1 

Ala 








20 










25 










30 






Leu. 


m n 


Lys 


Lys 


Leu 


Glu 


Glu 


Leu 


Glu 


Leu 


Asp Glu Gin Gin Arq 


Lys 






35 










40 










45 








Leu 


Glu 


Ala 


Phe 


Leu 


Thr 


Gin 


Lys 


o±n 


Lys 


Val 


Gly 


Glu 


Leu 


Lys 




50 










55 










60 








Asp 


Asp 


Asp 


Phe 


Glu 


Lys 


He 


Ser 


Glu 


Leu 


Gly Ala 


Gly 


Asn 


Gly 


Gly 


65 










70 










75 






o u 


Val 


Val 


Phe 


Lys 


Val 


Ser 


His 


Lys 


Pro 


C Cx V* 


Gly 


Leu 


Val 


Met 


Ala 


Arg 










85 










on 










95 


Lys 


Leu 


He 


His 


Leu 


Glu 


He 


Lys 


Pro 


7\ "1 -j 

J\±3l 


He 


Arg 


Asn 


Gin 


He 


He 








100 










105 








110 






Arg 


Glu 


Leu 


Gin 


Val 


Leu 


His 


Glu 


Cys 


Asn 


Ser 


Pro 


Tyr 


He 


Val 


Gly 






115 










120 










125 






Phe 


Tyr 


Gly Ala 


Phe 


Tyr 


Ser 


Asp 


Gly 


VjjIU 


He 


Ser 


He 


Cys 


Met 


Glu 




130 










135 










140 








His 


Met 


Asp 


Gly 


Gly 


Ser 


Leu 


Asp 


Gin 


vdl 


Leu 


Lys 


Lys 


Ala 


Gly 


Arg 


145 










150 










155 








160 


He 


Pro 


Glu 


Gin 


He 


Leu 


Gly 


Lys 


Val 


o trX 


He 


Ala 


Val 


He 


Lys 


Gly 










165 










170 










175 


Leu 


Thr 


Tyr 


Leu 


Arg 


Glu 


Lys 


His 


Lys 


Tip 

lie 


Met 


His 


Arg Asp 


Val 


Lys 








180 










185 










190 




Pro 


Ser 


Asn 


He 


Leu 


Val 


Asn 


Ser Arg 


biy 


Glu 


He 


Lys 


Leu 


Cys 


Asp 






195 










200 










205 




Phe 


Gly Val 


Ser 


Gly Gin Leu 


He 


Asp 


o fc: x 


Met 


Ala 


Asn 


Ser 


Phe 


val 




210 










215 










220 










Gly Thr Arg 


Ser 


Tyr 


Met 


Ser 


Pro 


Glu 


Arg 


Leu 


Gin Gly Thr His 


Tyr 


225 










230 










235 










240 


Ser 


Val 


Gin 


Ser 


Asp 


He 


Trp 


Ser 


Met 


Gly 


Leu 


Ser 


Leu 


Val 


Glu 


Met 










245 










250 










255 




Ala 


Val 


Gly Arg 


Tyr 


Pro 


He 


Pro 


Pro 


Pro 


Asp Ala 


Lys 


Glu 


Leu 


Glu 








260 










265 








270 






Leu 


Met 


Phe 


Gly 


Cys 


Gin 


Val 


Glu Gly 


Asp 


Ala 


Ala 


Glu 


Thr 


Pro 


Pro 






275 










280 










285 








Arg 


Pro 


Arg 


Thr 


Pro 


Gly Arg 


Pro 


Leu 


Ser 


Ser 


Tyr 


Gly Met 


Asp 


Ser 




290 










295 










300 








Arg 


Pro 


Pro 


Met 


Ala 


He 


Phe 


Glu 


Leu 


Leu 


Asp 


Tyr 


He 


Val 


Asn 


Glu 


305 










310 










315 








320 


Pro 


Pro 


Pro 


Lys 


Leu 


Pro 


Ser 


Gly Val 


Phe 


Ser 


Leu 


Glu 


Phe 


Gin 


Asp 










325 










330 










335 


Phe 


Val 


Asn 


Lys 


Cys 


Leu 


He 


Lys 


Asn 


Pro 


Ala 


Glu Arg Ala Asp 


Leu 








340 










345 










350 






Lys 


Gin 


Leu 


Met 


Val 


His 


Ala 


Phe 


He 


Lys 


Arg 


Ser 


Asp 


Ala 


Glu 


Glu 






355 










360 










365 








Val 


Asp 


Phe 


Ala 


Gly 


Trp 


Leu 


Cys 


Ser 


Thr 


He 


Gly 


Leu 


Asn 


Gin 


Pro 




370 










375 










380 










Ser 


Thr 


Pro 


Thr 


His 


Ala 


Ala 


Gly Val 
















385 










390 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 





Met 


Leu 


Ala 


Arg 


.rtx y 


Lys 


Pro 


Val 


Leu 


XT X U 


Ala 


Leu 


Thr 


x le 


Asn 


Pro 




1 








c 
D 










1U 










15 




10 


Thr 


He 


Ala 


Glu 


Gly 


Prn 

trJL <J 


O CI 


XT X 


X. XXX 


OCX 


Glu 


Gly Ala 


Ser 


Glu 


Ala 










20 










A D 










J 0 








Asn 


Leu 


Val 


Asp 


Leu 


Gin 


Lys 


Lys 


Leu 


Glu 


Glu 


Leu 


Glu 


Leu 


As P 


Glu 








35 




















45 








Gin 


Gin 


Lys 


Lys 


7\ -yrt 

/ii y 


XJG u. 


m n 

VJl Li 




JrXie 


Leu 


Thr 


Gin 


Lys 


Ala 


Lys 


Val 


15 




c; n 

■D V 










DO 










60 








Gly 


UJ.U 


Leu 


Lys 


ASp 


Asp 


nop 


rile 


bill 


Arg 


He 


Ser 


Glu 


Leu 


Gly Ala 




D O 










/ u 










75 










80 




Gly 


Asn 


Gly 


Gly 


V dX 
O 3 


Val 


HPVi 

X Il-L 


Lys 


vol 


bin 

9 U 


His 


Arg 


Pro 


Ser 


Gly 
95 


Leu 


20 


He 


Met 


Ala 


Arg 


Lys 


Leu 


Tip 

x xe 


nlE 


Leu 


o±U 


He 


Lys 


Pro 


Ala 


He 


Arg 










100 










xUb 










110 






Asn 


Gin 


He 
115 


He 


A "vet 
x-ix y 


w JL U. 


Leu 


Gin 
120 


Val 


Leu 


His 


Glu 


Cys 
125 


Asn 


Ser 


Pro 




Tvr 


He 


Val 


Gly 


Phe 


T\ry 


Gly 


Ala 


Phe 


±yr 


Ser 


Asp 


Gly 


Glu 


He 


Ser 


25 




1 J u 










JLO O 










140 










He 


Cys 


Met 


Glu 


His 


Me=>i- 




Gly 


Gly 




Leu 


Asp 


Gin 


Val 


Leu 


Lys 




145 










i cn 

X _> U 










155 










160 




Glu 


Ala 


Lys 


Arg 


He 


Pro 


m n 

ox u. 


Glu 


He 


j-ieu 


Gly Lys 


Val 


Ser 


He 


Ala 












ID j 










1 / U 










175 




30 


Val 


Leu 


Arg Gly 


Leu 


Ala 




Leu 


Arg 


UX IX 


Lys 


His 


Gin 


He 


Met 


His 










180 










185 










190 








Arg 


Asp 


Val 
195 


Lys 


Pro 


Ser 


Asn 


He 
200 


Leu 


Val 


Asn 


Ser 


Arg 
205 


Gly 


Glu 


He 


35 


Lys 


Leu 


Cys 


Asp 


Phe 


Gly 


Val 


Ser 


Gly 


m n 

vJXll 


Leu 


He 


Asp 


Ser 


Met 


Ala 




210 




















220 










Asn 


Ser 


Phe 


Val 


Gly 


Thr 


Arg 


Ser 


Tyr 




Ala 


Pro 


Glu 


Arg 


Leu 


Gin 




225 










23 0 










235 








240 




Gly 


Thr 


His 


Tyr 


Ser 


Val 


Gin 


Ser 


Asp 


He 


Trp 


Ser 


Met 


Gly 


Leu 


Ser 












245 










250 








255 




40 


Leu 


Val 


Glu 


Leu 


Ala 


Val 


Gly 


Arg 


Tyr 


Pro 


He 


Pro 


Pro 


Pro 


Asp 


Ala 










260 










265 










270 






Lys 


Glu 


Leu 


Glu 


Ala 


He 


Phe 


Gly Arg 


Pro 


Val 


Val 


Asp 


Gly 


Glu 


Glu 








275 










280 










285 






45 


Gly 


Glu 


Pro 


His 


Ser 


He 


Ser 


Pro 


Arg 


Pro 


Arg 


Pro 


Pro 


Gly Arg 


Pro 




290 










295 










300 












Val 


Ser 


Gly 


His 


Gly 


Met 


Asp 


Ser 


Arg 


Pro 


Ala 


Met 


Ala 


He 


Phe 


Glu 




305 










310 










315 










320 




Leu 


Leu 


Asp 


Tyr 


He 


Val 


Asn 


Glu 


Pro 


Pro 


Pro 


Lys 


Leu 


Pro 


Asn Gly 












325 










330 










335 




50 


Val 


Phe 


Thr 


Pro 


Asp 


Phe 


Gin 


Glu 


Phe 


Val 


Asn 


Lys 


Cys 


Leu 


He 


Lys 










340 










345 










350 






Asn 


Pro 


Ala 
355 


Glu 


Arg 


Ala 


Asp 


Leu 
360 


Lys 


Met 


Leu 


Thr 


Asn 
365 


His 


Thr 


Phe 




He 


Lys 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 


Val 


Asp 


Phe 


Ala 


Gly 


Trp 


Leu 


Cys 


55 




370 










375 










380 






Lys 


Thr 


Leu 


Arg 


Leu 


Asn 


Gin 


Pro 


Gly 


Thr 


Pro 


Thr 


Arg 


Thr 


Ala 


Val 




385 










390 










395 








400 



(2) INFORMATION FOR SEQ ID NO: 13 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 3 67 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

5 Gly Thr Thr Pro Arg Thr Gly Asn Ser Asn Asn Ser Asn Ser Gly Ser 

15 10 15 

Ser Gly Gly Gly Gly Leu Phe Ala Asn Phe Ser Lys Tyr Val Asp lie 

20 25 ' ' 3 0 

Lys Ser Gly Ser Leu Asn Phe Ala Gly Lys Leu Ser Leu Ser Ser Lys 
10 35 40 45 

Gly lie Asp Phe Ser Asn Gly Ser Ser Ser Arg lie Thr Leu Asp Glu 

50 55 60 

Leu Glu Phe Leu Asp Glu Leu Gly His Gly Asn Tyr Gly Asn Val Ser 
65 70 75 ^ 80 

15 Lys Val Leu His Lys Pro Thr Asn Val lie Met Ala Thr Lys Glu Val 

85 90 ' 95 

Arg Leu Glu Leu Asp Glu Ala Lys Phe Arg Gin lie Leu Met Glu Leu 

100 105 110 

Glu Val Leu His Lys Cys Asn Ser Pro Tyr lie Val Asp Phe Tyr Gly 
20 115 120 ' 125 

Ala Phe Phe He Glu Gly Ala Val Tyr Met Cys Met Glu Tyr Met Asp 

130 135 140 

Gly Gly Ser Leu Asp Lys He Tyr Asp Glu Ser Ser Glu He Gly Gly 
145 150 155 160 

25 He Asp Glu Pro Gin Leu Ala Phe He Ala Asn Ala Val He His Gly 

165 170 175 

Leu Lys Glu Leu Lys Glu Gin His Asn He He His Arg Asp Val Lys 

180 185 190 

Pro Thr Asn He Leu Cys Ser Ala Asn Gin Gly Thr Val Lys Leu Cys 
30 ^ 195 200 ~ 205 

Asp Phe Gly Val Ser Gly Asn Leu Val Ala Ser Leu Ala Lys Thr Met 

210 215 220 

Asn He Gly Cys Gin Ser Tyr Met Ala Pro Glu Arg He Lys Ser Leu 
225 230 235 " 240 

35 Asn Pro Asp Arg Ala Thr Tyr Thr Val Gin Ser Asp He Trp Ser Leu 

245 250 " 255 

Gly Leu Ser He Leu Glu Met Ala Leu Gly Arg Tyr Pro Tyr Pro Pro 

260 265 ~ " 270 

Glu Thr Tyr Asp Asn He Phe Ser Gin Leu Ser Ala He Val Asp Gly 
40 ^ 275 280 285 

Pro Pro Pro Arg Leu Pro Ser Asp Lys Phe Ser Ser Asp Ala Gin Asp 

290 295 300 

Phe Val Ser Leu Cys Leu Gin Lys He Pro Glu Arg Arg Pro Thr Tyr 
305 310 315 ~ 320 

45 Ala Ala Leu Thr Glu His Pro Trp Leu Val Lys Tyr Arg Asn Gin Asp 

325 330 335 

Val His Met Ser Glu Tyr He Thr Glu Arg Leu Glu Arg Arg Asn Lys 

340 345 350 

He Leu Arg Glu Arg Gly Glu Asn Gly Leu Ser Lys Asn Val Pro 
50 355 360 * 365 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
TTYTAYGGNG CNTTYTTYAT HGA 2 3 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
ATBCTYTCNG GNGCCATKTA 20 

10 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 17: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

2 5 (ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 281... 1318 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

3 0 GGAAAGGCAG CCTCCTGTAG GTGAAAATTC TGTTCACTAC CTGGCCACCT GGCCTGACTG 6 0 

ACCTTCACAG CTTGATCATC TTCCTGAAGA GGCATTCAGG ATTCCCTCCA TCCCTACCCC 12 0 

TTCTGGACAA AGTCTTCCAC GTTTCCTTCC TGGGAGTTTC TTCCAGGAAC TGGAGATACC 180 

CAGAGCCCTG CAACTCCCAC TGGCCAACGA TGGGGGCAGC CGCTCACCAT CCTCAGAGAG 24 0 

CTCCCCACAG CACCCTACAC CCCCCACCCG GCCCCGCCAC ATG CTG GGG CTC CCA 2 95 
35 Met Leu Gly Leu Pro 

1 5 

TCA ACC TTG TTC ACA CCG CGC AGT ATG GAG AGC ATC GAG ATT GAC CAG 34 3 
Ser Thr Leu Phe Thr Pro Arg Ser Met Glu Ser lie Glu lie Asp Gin 
10 15 20 

40 AAG CTG CAG GAG ATC ATG AAG CAG ACA GGG TAC CTG ACT ATC GGG GGC 3 91 

Lys Leu Gin Glu lie Met Lys Gin Thr Gly Tyr Leu Thr lie Gly Gly 
25 30 35 

CAG CGT TAT CAG GCA GAA ATC AAT GAC TTG GAG AAC TTG GGT GAG ATG 43 9 

Gin Arg Tyr Gin Ala Glu lie Asn Asp Leu Glu Asn Leu Gly Glu Met 
45 40 45 50 
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GGC AGT GGT ACC 
Gly Ser Gly Thr 
55 



TGT GGT CAG GTG TGG AAG ATG CGG TTC CGG AAG ACA 
Cys Gly Gin Val Trp Lys Met Arg Phe Arg Lys Thr 
60 65 



487 



10 



15 



GGC CAC ATC ATT 
Gly His He He 
70 

GAG AAT AAG CGC 
Glu Asn Lys Arg 



GAC TGC CCT TAC 
Asp Cys Pro Tyr 
105 

GAC GTC TTT ATT 
Asp Val Phe He 
120 

AAG AAA CGA ATG 
Lys Lys Arg Met 
135 



GCT GTT AAG CAA ATG CGG CGC TCT GGG AAC AAG GAA 
Ala Val Lys Gin Met Arg Arg Ser Gly Asn Lys Glu 
75 80 ** 85 

ATT TTG ATG GAC CTG GAT GTA GTA CTC AAG AGC CAT 
He Leu Met Asp Leu Asp Val Val Leu Lys Ser His 
90 95 100 

ATC GTT CAG TGC TTT GGC ACC TTC ATC ACC AAC ACA 
He Val Gin Cys Phe Gly Thr Phe He Thr Asn Thr 
110 115 

GCC ATG GAG CTC ATG GGC ACA TGT GCA GAG AAG CTG 
Ala Met Glu Leu Met Gly Thr Cys Ala Glu Lys Leu 
125 130 

CAG GGC CCC ATT CCA GAG CGA ATC CTG GGC AAC ATG 
Gin Gly Pro He Pro Glu Arg He Leu Gly Asn Met 
140 145 



535 



583 



631 



679 



727 



ACT GTG GCG ATT 
2 0 Thr Val Ala He 

150 

GTC ATC CAT CGC 
Val He His Arg 



25 GGC CAG ATC AAG 

Gly Gin He Lys 
185 



GTG AAA GCA CTG TAC TAT CTG AAG GAG AAG CAT GGC 

Val Lys Ala Leu Tyr Tyr Leu Lys Glu Lys His Gly 

155 160 * ' 165 

GAT GTC AAA CCC TCC AAC ATC CTG CTA GAT GAG CGG 

Asp Val Lys Pro Ser Asn He Leu Leu Asp Glu Arg 

170 175 180 

CTC TGT GAC TTT GGC ATC AGT GGC CGC CTT GTT GAC 

Leu Cys Asp Phe Gly He Ser Gly Arg Leu Val Asp 
190 195 



775 



823 



871 



30 



TCC AAA GCC AAA 
Ser Lys Ala Lys 
200 

GAG CGC ATC GAC 
Glu Arg He Asp 
215 



ACA CGG AGT GCT GGC TGT GCT GCC TAT ATG GCT CCC 

Thr Arg Ser Ala Gly Cys Ala Ala Tyr Met Ala Pro 

205 210 

CCT CCA GAT CCC ACC AAG CCT GAC TAT GAC ATC CGA 

Pro Pro Asp Pro Thr Lys Pro Asp Tyr Asp He Arg 
220 225 



919 



967 



35 



40 



45 



GCT GAT GTG TGG 
Ala Asp Val Trp 
230 

CAG TTC CCC TAT 
Gin Phe Pro Tyr 



GTC CTA CAG GAA 
Val Leu Gin Glu 
265 

GGG GAC TTC CAG 
Gly Asp Phe Gin 
280 



AGC CTG GGC ATC TCA CTG GTG GAG CTG GCA ACA GGA 1015 

Ser Leu Gly He Ser Leu Val Glu Leu Ala Thr Gly 

235 240 245 

AAG AAC TGC AAG ACG GAC TTT GAG GTC CTC ACC AAA 10 63 

Lys Asn Cys Lys Thr Asp Phe Glu Val Leu Thr Lys 

250 255 260 

GAG CCC CCA CTC CTG CCT GGT CAC ATG GGC TTC TCA 1111 

Glu Pro Pro Leu Leu Pro Gly His Met Gly Phe Ser 

270 275 



TCA TTT GTC AAA GAC TGC CTT ACT AAA GAT CAC AGG 
Ser Phe Val Lys Asp Cys Leu Thr Lys Asp His Arg 
2 85 2 90 



1159 



AAG AGA CCA AAG 
Lys Arg Pro Lys 
295 



TAT AAT AAG CTA CTT GAA CAC AGC TTC ATC AAG CAC 
Tyr Asn Lys Leu Leu Glu His Ser Phe He Lys His 
300 305 



1207 



TAT GAG ATA CTC 
50 Tyr Glu He Leu 

310 



GAG GTG GAT GTC GCG TCC TGG TTT AAG GAT GTC ATG 
Glu Val Asp Val Ala Ser Trp Phe Lys Asp Val Met 
315 320 * ~ 325 



1255 
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GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC CTG AGT CAG CAC CAT 13 03 

Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val Leu Ser Gin His His 
330 335 340 

CTG CCC TTC TTC AGG TAGCCTCATG GCAGCGGCCA GCCCCGCAGG GGCCCCGGGC C 1359 
5 Leu Pro Phe Phe Arg 

345 

ACGGCCACCG ACCCCCCCCC CAACCTGGCC AACCCAGCTG CCCATCAGGG GACCTGGGAC 1419 

CTGGACGACT GCCAAGGACT GAGGACAGAA AGTAGGGGGT TCCCATCCAG CTCTGACTCC 14 7 9 

CTGCCTACCA GCTGTGGACA AAAGGGCATG CTGGTTCCTA ATCCCTCCCA CTCTGGGGTC 153 9 

10 AGCCAGCAGT GTGAGCCCCA TCCCACCCCG ACAGACACTG TGAACGGAAG ACAGCAGGCC 15 99 

AAAAAAAAAA AAAAAAAAAA AAAA 1623 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 6 amino acids 

15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



20 


Met 


Leu 


Gly 


Leu 


Pro 


Ser 


Thr 


Leu 


Phe 


Thr 


Pro 


Arg 


Ser 


Met 


Glu 


Ser 




1 








5 


















15 






lie 


Glu 


He 


Asp 


Gin 


Lys 


Leu 


Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr Gly Tyr 










20 










25 










30 








Leu 


Thr 


He 


Gly 


Gly 


Gin 


Arg 


Tyr 


Gin 


Ala 


Glu 


He 


Asn 


Asp 


Leu 


Glu 


25 






35 










40 










45 








Asn 


Leu 


Gly 


Glu 


Met 


Gly 


Ser 


Gly Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Met 






50 










55 










60 










Arg 


Phe 


Arg 


Lys 


Thr 


Gly 


His 


He 


He 


Ala 


Val 


Lys 


Gin 


Met 


Arg 


Arg 




65 










70 










75 








80 


30 


Ser 


Gly 


Asn 


Lys 


Glu 
85 


Glu 


Asn 


Lys 


Arg 


He 
90 


Leu 


Met 


Asp 


Leu 


Asp 
95 


Val 




Val 


Leu 


Lys 


Ser 


His 


Asp 


Cys 


Pro 


Tyr 


He 


Val 


Gin 


Cys 


Phe 


Gly 


Thr 










100 










105 










110 






Phe 


He 


Thr 


Asn 


Thr 


Asp 


Val 


Phe 


lie 


Ala 


Met 


Glu 


Leu 


Met 


Gly 


Thr 


35 






115 










120 










125 








Cys 


Ala 


Glu 


Lys 


Leu 


Lys 


Lys 


Arg 


Met 


Gin 


Gly 


Pro 


He 


Pro 


Glu 


Arg 






130 










135 










140 










lie 


Leu 


Gly 


Asn 


Met 


Thr 


Val 


Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr 


Tyr 


Leu 




145 










150 










155 








160 


40 


Lys 


Glu 


Lys 


His 


Gly 
165 


Val 


He 


His 


Arg 


Asp 
170 


Val 


Lys 


Pro 


Ser 


Asn 
175 


He 




Leu 


Leu 


Asp 


Glu 
180 


Arg 


Gly 


Gin 


He 


Lys 
185 


Leu 


Cys 


Asp 


Phe 


Gly 
190 


He 


Ser 


45 


Gly 


Arg 


Leu 


Val 


Asp 


Ser 


Lys 


Ala 


Lys 


Thr 


Arg 


Ser 


Ala 


Gly 


Cys 


Ala 






195 










200 










205 










Ala 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


He 


Asp 


Pro 


Pro 


Asp 


Pro 


Thr 


Lys 


Pro 






210 










215 










220 










Asp 


Tyr 


Asp 


He 


Arg 


Ala 


Asp 


Val 


Trp 


Ser 


Leu 


Gly 


He 


Ser 


Leu 


Val 




225 










230 










235 










240 


50 


Glu 


Leu 


Ala 


Thr 


Gly 
245 


Gin 


Phe 


Pro 


Tyr 


Lys 
250 


Asn 


Cys 


Lys 


Thr 


Asp 
255 


Phe 




Glu 


Val 


Leu 


Thr 


Lys 


Val 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


Leu 


Leu 


Pro 


Gly 










260 










265 










270 






His 


Met 


Gly 


Phe 


Ser 


Gly 


Asp 


Phe 


Gin 


Ser 


Phe 


Val 


Lys 


Asp 


Cys 


Leu 


55 






275 










280 










285 










Thr 


Lys 
290 


Asp 


His 


Arg 


Lys 


Arg 
295 


Pro 


Lys 


Tyr 


Asn 


Lys 
300 


Leu 


Leu 


Glu 


His 




Ser 


Phe 


He 


Lys 


His 


Tyr 


Glu 


He 


Leu 


Glu 


Val 


Asp 


Val 


Ala 


Ser 


Trp 




305 










310 










315 










320 
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Phe Lys Asp Val Met Ala Lys Thr 
325 

Leu Ser Gin His His Leu Pro Phe 
340 



23 



Glu Ser Pro Arg Thr Ser Gly Val 

330 ~ 335 

Phe Arg 
345 



5 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 3 . . .1169 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GC ACG AGC CCT GCT CCT GCC CCG TCC CAG CGA GCA GCC CTG CAA CTC 4 7 

Thr Ser Pro Ala Pro Ala Pro Ser Gin Arg Ala Ala Leu Gin Leu 
15 10 15 

CCA CTG GCC AAC GAT GGG GGC AGC CGC TCA CCA TCC TCA GAG AGC TCC 95 

2 0 Pro Leu Ala Asn Asp Gly Gly Ser Arg Ser Pro Ser Ser Glu Ser Ser 

20 25 30 

CCA CAG CAC CCT ACA CCC CCC ACC CGG CCC CGC CAC ATG CTG GGG CTC 143 
Pro Gin His Pro Thr Pro Pro Thr Arg Pro Arg His Met Leu Gly Leu 
35 40 45 

25 CCA TCA ACC TTG TTC ACA CCG CGC AGT ATG GAG AGC ATC GAG ATT GAC 191 

Pro Ser Thr Leu Phe Thr Pro Arg Ser Met Glu Ser lie Glu lie Asp 
50 55 60 

CAG AAG CTG CAG GAG ATC ATG AAG CAG ACA GGG TAC CTG ACT ATC GGG 23 9 

Gin Lys Leu Gin Glu lie Met Lys Gin Thr Gly Tyr Leu Thr lie Gly 
30 65 70 75 

GGC CAG CGT TAT CAG GCA GAA ATC AAT GAC TTG GAG AAC TTG GGT GAG 2 87 

Gly Gin Arg Tyr Gin Ala Glu lie Asn Asp Leu Glu Asn Leu Gly Glu 
80 85 90 95 

ATG GGC AGT GGT ACC TGT GGT CAG GTG TGG AAG ATG CGG TTC CGG AAG 33 5 

3 5 Met Gly Ser Gly Thr Cys Gly Gin Val Trp Lys Met Arg Phe Arg Lys 

100 105 ~ 110 

ACA GGC CAC ATC ATT GCT GTT AAG CAA ATG CGG CGC TCT GGG AAC AAG 3 83 

Thr Gly His lie lie Ala Val Lys Gin Met Arg Arg Ser Gly Asn Lys 
115 120 125 

4 0 GAA GAG AAT AAG CGC ATT TTG ATG GAC CTG GAT GTA GTA CTC AAG AGC 431 

Glu Glu Asn Lys Arg lie Leu Met Asp Leu Asp Val Val Leu Lys Ser 
130 135 140 

CAT GAC TGC CCT TAC ATC GTT CAG TGC TTT GGC ACC TTC ATC ACC AAC 4 79 

His Asp Cys Pro Tyr lie Val Gin Cys Phe Gly Thr Phe lie Thr Asn 
45 145 150 155 

ACA GAC GTC TTT ATT GCC ATG GAG CTC ATG GGC ACA TGT GCA GAG AAG 52 7 

Thr Asp Val Phe lie Ala Met Glu Leu Met Gly Thr Cys Ala Glu Lys 
160 165 170 " 175 
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CTG AAG AAA CGA ATG CAG GGC CCC ATT CCA GAG CGA ATC CTG GGC AAG 575 
Leu Lys Lys Arg Met Gin Gly Pro He Pro Glu Arg He Leu Gly Lys 
180 185 190 

ATG ACT GTG GCG ATT GTG AAA GCA CTG TAC TAT CTG AAG GAG AAG CAT 623 
5 Met Thr Val Ala He Val Lys Ala Leu Tyr Tyr Leu Lys Glu Lys His 

195 200 ^ 205 

GGC GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTG CTA GAT GAG 671 
Gly Val He His Arg Asp Val Lys Pro Ser Asn He Leu Leu Asp Glu 
210 215 220 

10 CGG GGC CAG ATC AAG CTC TGT GAC TTT GGC ATC AGT GGC CGC CTT GTT 719 

Arg Gly Gin He Lys Leu Cys Asp Phe Gly He Ser Gly Arg Leu Val 
225 230 235 

GAC TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC TAT ATG GCT 7 67 

Asp Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala Tyr Met Ala 
15 240 245 250 " 255 

CCC GAG CGC ATC GAC CCT CCA GAT CCC ACC AAG CCT GAC TAT GAC ATC 815 
Pro Glu Arg He Asp Pro Pro Asp Pro Thr Lys Pro Asp Tyr Asp He 
260 265 270 

CGA GCT GAT GTG TGG AGC CTG GGC ATC TCA CTG GTG GAG CTG GCA ACA 8 63 

20 Arg Ala Asp Val Trp Ser Leu Gly He Ser Leu Val Glu Leu Ala Thr 

275 280 285 

GGA CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG GTC CTC ACC 911 
Gly Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu Val Leu Thr 
290 295 300 

25 AAA GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC ATG GGC TTC 959 

Lys Val Leu Gin Glu Glu Pro Pro Leu Leu Pro Gly His Met Gly Phe 
305 310 315 

TCA GGG GAC TTC CAG TCA TTT GTC AAA GAC TGC CTT ACT AAA GAT CAC 1007 
Ser Gly Asp Phe Gin Ser Phe Val Lys Asp Cys Leu Thr Lys Asp His 
30 320 325 330 " 335 

AGG AAG AGA CCA AAG TAT AAT AAG CTA CTT GAA CAC AGC TTC ATC AAG 1055 
Arg Lys Arg Pro Lys Tyr Asn Lys Leu Leu Glu His Ser Phe He Lys 
340 345 350 

CAC TAT GAG ATA CTC GAG GTG GAT GTC GCG TCC TGG TTT AAG GAT GTC 1103 
35 His Tyr Glu He Leu Glu Val Asp Val Ala Ser Trp Phe Lys Asp Val 

355 360 " 365 

ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC CTG AGT CAG CAC 1151 
Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val Leu Ser Gin His 
370 375 380 

4 0 CAT CTG CCC TTC TTC AGG TAGCCTCATG GCAGCGGCCA GCCCCGCAGG GGCCCCGG 1207 

His Leu Pro Phe Phe Arg 
385 

GCCACGGCCA CCGACCCCCC CCCCAACCTG GCCAACCCAG CTGCCCATCA GGGGAC CTGG 12 6 7 

GACCTGGACG ACTGCCAAGG ACTGAGGACA GAAAGTAGGG GGTTCCCATC CAGCTCTGAC 132 7 

45 TCCCTGCCTA CCAGCTGTGG ACAAAAGGGC ATGCTGGTTC CTAATCCCTC CCACTCTGGG 13 87 

GTCAGCCAGC AGTGTGAGCC CCATCCCACC CCGACAGACA CTGTGAACGG AAGACAGCAA 1447 

AAAAAAAAAA AAAAAAAA 1465 



(2) INFORMATION FOR SEQ ID NO: 20: 



50 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 89 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ> ID NO:20: 



Thr 


Ser 


Pro 


Ala 


Pro 


Ala 


Pro 


Ser 


Gin 


Arg 


Ala 


Ala 


Leu 


Gin 


Leu 


Pro 


1 








5 










10 










15 




Leu 


Ala 


Asn 


Asp 


Gly Gly 


Ser 


Arg 


Ser 


Pro 


Ser 


Ser 


Glu 


Ser 


Ser 


Pro 








20 










25 










30 






Gin 


His 


Pro 


Thr 


Pro 


Pro 


Thr 


Arg 


Pro 


Arg 


His 


Met 


Leu 


Gly 


Leu 


Pro 






35 










40 










45 






Ser 


Thr 


Leu 


Phe 


Thr 


Pro 


Arcr 


Ser 


Met 


Glu 


Ser 


Tip 
X -i. c 


Glu 


He 


Asp 


Gin 




50 




















b u 








Lvs 


Leu 


Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr 


Gly Tyr 


J-lG U 


Thr 


He 


Gly 


Gly 


65 










70 










75 








80 


Gin 


Arg 


Tyr 


Gin 


Ala 


Glu 


He 


Asn Asp 


Leu 


Glu 


A c; Tl 

"Oil 


Leu 


Gly Glu 












85 










90 










95 




Gly 


Ser Gly Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Met 


Arg 


Phe 


Arg 


Lys 


Tiir 








JL U U 










105 










110 




Gly 


Hi S 


lie 


lie 


Ala 


Val 


Lvs 


Gin 


Met 


Arg 


Arg 


Ser 


Gly Asn Lys 


Glu 






lib 










120 










125 








Glu 


Asn 


Lys 


Arg 


He 


Leu 


Met 


Asp 


Leu 


Asp 


Val 


Val 


Leu 


Lys 


Ser 


His 




13 0 










13 5 










J. *± U 








Asp 


Cys 


Pro 


Tyr 


He 


Val 


Gin 


Cys 


Phe 


Gly Thr 


-true 


He 


Thr 


Asn 


Thr 


145 










150 










155 










160 


Asp 


Val 


Phe 


He 


Ala 
165 


Met 


Glu 


Leu 


Met 


Gly 
170 


Thr 


Cys 


Ala 


Glu 


Lys 
175 


Leu 


Lys 


Lys 


Arg 


Met 


Gin Gly 


Pro 


He 


Pro 


Glu 


Arg 


He 


Leu 


Gly Lys 


Met 








180 










185 










190 






Thr 


Val 


Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr 


Tyr 


Leu 


Lys 


Glu 


Lys 


His 


<X\ \r 
oiy 






195 










200 










205 






Val 


lie 


His 


Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


He 


Leu 


Leu Asp 


Glu 


Arg 




210 










215 










220 








Gly 


Gin 


He 


Lys 


Leu 


Cys 


Asp 


Phe Gly 


He 


Ser 


Gly 


Arg 


Leu 


Val 


Asp 


225 










230 










235 










24 0 


Ser 


Lys 


Ala 


Lys 


Thr 


Arg 


Cot* 


Ala Gly 


Cys 


Ala 


Ala 


Tyr 


Met 


Ala 


Pro 










245 










250 










255 




Glu 


Arg 


He 


Asp 


Pro 


Pro 


Asp 


Pro 


Thr 


Lys 


Pro 


Asp 


Tyr 


Asp 


He 


Arg 








260 










265 










270 




Ala 


Asp 


Val 


Trp 


Ser 


Leu 


Gly 


He 


Ser 


Leu 


Val 


Glu 


Leu 


Ala 


Thr 


Gly 






275 










280 










285 






Gin 


Phe 


Pro 


Tyr 


Lys 


Asn 


Cys 


Lys 


Thr 


As P 


Phe 


Glu 


Val 


Leu 


Thr 


Lys 




290 










295 










300 








Val 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


Leu 


Leu 


Pro 


Gly His 


Met 


Gly 


Phe 


Ser 


305 










310 










315 








320 


Gly 


Asp 


Phe 


Gin 


Ser 


Phe 


Val 


Lys 


Asp 


Cys 


Leu 


Thr 


Lys 


Asp 


His 


Arg 










325 










330 










335 


Lys 


Arg 


Pro 


Lys 


Tyr 


Asn 


Lys 


Leu 


Leu 


Glu 


His 


Ser 


Phe 


He 


Lys 


His 








340 










345 










350 




Tyr 


Glu 


He 


Leu 


Glu 


Val 


Asp 


Val 


Ala 


Ser 


Trp 


Phe 


Lys 


Asp 


Val 


Met 






355 










360 










365 






Ala 


Lys 


Thr 


Glu 


Ser 


Pro 


Arg 


Thr 


Ser 


Gly Val 


Leu 


Ser 


Gin 


His 


His 




370 










375 










380 










Leu 


Pro 


Phe 


Phe 


Arg 
























385 































10 



15 



20 



25 



30 



35 



40 



45 



50 



55 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 





Ser 


Ala 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser Ala 


Ser 


Ala 


Phe 


Ala 


Ser 


Ala 


Ala 




1 








5 








10 










15 






Pro 


Ala 


Thr 


Gly 


i nr 


File 




Gly Thr 


Tyr 


Thr 


Pro 


Pro 


Thr 


Thr 


Arg 










20 








<si b 










30 






Vdl 


Ser Arg Ala 


Thr 


Pro 


Thr 


Leu Pro 


Met 


Leu 


Ser 


Ser 


Gly Pro 


Gly 








35 










a n 








45 










Gly Leu Asn 


Arg 


i- I1X 


Arg 


irro Asn 


He 


Leu 


Pro 


Leu 


Pro 


Thr 


Pro 


10 




50 










55 








60 












rl (J 


His 


Pro 


Pro 


Vol 


Ser 




inr Asp 


Met 


Lys 


Leu 


Lys 


He 


He 


Met 




65 










70 








75 








80 




Pin 


Gin Thr Gly 


Lys 


Leu 


Asn 


_ _ 

lie Asn 


Gly Arg Gin 


Tyr 


Pro 


Thr 


Asp 












85 








90 










95 




Tip 
1 lG 


Asn Asp 


Leu 


Lys 


U ' _ 

ill s 


Leu 


Ljiy Asp 


Leu 


Gly Asn 


Gly Thr 


Ser 


Gly 










100 


















110 






Asn 


Val 


Val 
115 


Lys 


fiec 


Me u 


rilS 


Leu Ser 
ion 

X U 


Ser 


Asn 


Thr 


He 
125 


He 


Ala 


Val 




Lys 


Gin 


Met 


Arg 


Arg 


i nr 


wiy 


Asn Ala 


Glu 


Glu 


Asn 


Lys 


Arg 


He 


Leu 


20 




130 










13 5 








140 










Asp 


Leu 


Asp 


Vdl 


Vdl 


Leu 


Lys Ser 


His 


Asp 


Cys 


Lys 


Tyr 


He 


Val 




145 










JL 3 U 








155 








160 




Lys 


Cys 


Leu 


Gly 


Cys 

J. D D 


Fne 


vai 


Arg Asp 


Pro 
170 


Asp 


Val 


Trp 


He 


Cys 
175 


Met 


z o 


kjl U 


Leu 


Met 


Ser 


Meu 


Cys 


Pne 


Asp Lys 


Leu 


Leu 


Lys 


Leu 


Ser 


Lys 


Lys 










180 








loo 










190 




Pro 


Val 


Pro 
195 


Glu 


Cain 


lie 


Leu 


Gly Lys 

A U U 


Val 


Thr 


Val 


Ala 
205 


Thr 


Val 


Asn 




Ala 


Leu 


Ser 


Tyr 


Leu 


Lys 


Asp 


Lys His 


Gly 


Val 


He 


His 


Arg 


Asp 


Val 


30 




210 


















220 








Lys 


Pro 


Ser 


Asn 


Tl Q 

i le 


Leu 


Tl Q 

lie 


Asp CjIU 


Arg 


Gly Asn 


He 


Lys 


Leu 


Cys 




225 










230 








235 








240 




Asp 


Phe 


Gly 


He 


Ser 


Cjiiy 


Arg 


lieu val 


Asp 


Ser 


Lys 


Ala 


Lys 


Thr 


Arg 












OA ^ 

*L '-± ZJ 








250 










255 




ber 


Ala 


Gly 


Cys 


ax a 


Ala 


Tyr 


Met: Ala 


Pro 


Glu Arg 


He 


Asp 


Pro 


Lys 










260 








O £T C 










270 






Lys 


Pro 


Lys 


Tyr 


Asp 


He 


Arg 


Ala Asp 


Val 


Trp 


Ser 


Leu 


Gly 


He 


Thr 








275 










2 80 








285 








Leu. 


Val 


Glu 


Leu 


Ala 


Thr 


Ala 


Arg Ser 


Pro 


Tyr 


Glu 


Gly Cys 


Asn 


Thr 


40 




290 










295 








300 












Asp 


Phe 


Glu 


Val 


Leu 


Thr 


Lys 


Val Leu 


Asp 


Ser 


Glu 


Pro 


Pro 


Cys 


Leu 




305 










310 








315 








320 




Pro 


Tyr Gly Glu 


Gly 


Tyr 


Asn 


Phe Ser 


Gin 


Gin 


Phe 


Arg Asp 


Phe 


Val 












325 








330 










335 




45 


He 


Lys 


Cys 


Leu 
340 


Thr 


Lys 


Asn 


His Gin 
345 


Asp 


Arg 


Pro 


Lys 


Tyr 
350 


Pro 


Glu 




Leu 


Leu 


Ala 


Gin 


Pro 


Phe 


He 


Arg He 


Tyr 


Glu 


Ser 


Ala 


Lys 


Val 


Asp 








355 










360 








365 






Val 


Pro 


Asn 


Gin 


Ser 


He 


Lys 


Asp Asn 


Arg 


Leu 


Arg 


Ala 


Asn 


Gly 


Asp 


50 


Pro 
385 


370 
Thr 


Leu 


Gin 


Arg 


Leu 
390 


375 
Pro 


Asn Ser 






380 







(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 05 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 





lie 


Gly Gin Val 


Leu 


Pro 


\jJ_U 


Ala 


Thr 


Thr 


Thr 


Ala 


Phe 


Glu 


Tyr 


Glu 




X 








o 










1 u 










15 






Asp 


Glu Asp 


Gly 


Asp 


Arg 


lie 


Thr 


Val 


Arg 


Ser 


Asp 


Glu 


Glu 


Met 


Lys 


5 








20 










25 










30 








Met 


Leu 
35 


Ser 


Tyr 


Tyr 


Tyr 


Ser 
40 


Thr 


val 


Met 


Glu 


Gin 
45 


Gin 


Val 


Asn 






Gin 


Leu 


lie 


r*~\ it 
(jIU 


Pro 


Leu 


Gin 


He 


Pne 


Pro 


Arg 


Ala 


Cys 


Lys 


Pro 






50 




















D U 








1 0 




Gly Glu Arg 


Asn 


lie 


111 s 


Gly Leu 


Lys 


vai 


Asn 


Thr Arg Ala Gly 




65 










70 




















80 






Ser 


Gin 


His 


Q •v- 


5ci 


Pro 


Ala 


Val 


Ser 


Asp 


Ser 


Leu 


Pro 


Ser 


Asn 












ft ^ 










q n 








95 








Leu 


Lys 


Lys 


OCT 


Co v 

oer 


Aid 


Glu 


Leu 


Lys 


Lys 


Tic 

lie 


Leu Ala Asn Gly 


15 








100 










105 










110 








V3X1X 


Met 


Asn 


Glu 


CXI m 

win 


Asp 


i ie 


Arg 


Tyr 


Arg 


Asp 


Thr 


Leu Gly His 


Gly 








115 










120 










125 








noli 


Gly Gly Thr 


Val 


V3fX u. 


Lys 


Met 


Arg 


XT ' 

Hi s 


Val 


Pro 


Ser 


Gly 


Lys 


He 






13 0 










135 










140 










Leu 


Ala 


val 


Lys 


vdl 


lie 


Leu 


Leu Asp 


Ti- 
ne 


Thr 


Leu 


Glu 


Leu 


Gin 


Lys 




145 










15 0 










155 










160 




m n 


i _Le 


Met 


Ser 


oiu 


Leu 


ulU 


He 


Leu 


Tic, 

ne 


Lys 


Cys 


Asp 


Ser 


Ser 


Tyr 












IDJ 










1 / u 










175 




x is 


T 1 (=, 
lie 


biy 


irlie 


Tyr 


Ljiy 


Ala 


Phe 


Phe 


Val 


Glu 


Asn 


Arg 


He 


Ser 


He 


25 








180 










185 








190 








^— y o 


Thr 


Glu 


Phe 


i v ie u 


Asp 


Lriy 


Gly 


Ser 


Leu 


Asp 


Asp 


lie 


Gly 


Lys 


Met 








195 










200 










205 








Pro 


Glu 


His 


Val 


Leu 


Liiy 


Arg 


He 


Ala 


val 


Ala 


Val 


Val 


Lys 


Gly 


Leu 






210 










215 










220 










TV-it* 

-L IlX 

225 


Tyr 


Lys 


Gly 


Leu 


Thr 
230 


Tyr 


Leu 


Trp 


Ser 


Leu 
235 


Lys 


He 


Leu 


His 


Arg 
240 




Asp 


Val 


Lys 


Pro 


Ser 


Asn 


Met 


Val 


Asn 


Thr 


Arg Gly 


Gin 


Val 


Lys 


Leu 






















*5 tr n 










255 






Cys 


Asp 


Phe 


Gly 




Ser 


Thr 


Gin 


Leu 


val 


Asn 


Ser 


He 


Ala 


Lys 


Thr 


35 








260 










265 










270 






Tyr 


Val 


Gly 


Thr 


Asn 


7\ *1 

/\ia 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


He 


Ser 


Gly 


Glu 








275 










280 








285 








VjXIi 


Tyr 
290 


Gly 


lie 


ill S 


Ser 


Asp 


Val 


Trp 


Ser 


Leu 


Gly 
300 


He 


Thr 


Met 


He 




nil 1 1 

305 


Leu 


Ala 


Thr 


r\ *i Tr 

Cj±y 


Arg 
310 


rne 


Pro 


Tyr 


Pro 


Lys 
315 


Trp 


Asn 


Ser 


Val 


Leu 
320 




Gin 


Leu 


Leu 


Gin 


Cys 
325 


He 


Val 


Asp 


Glu 


Asp 
330 


Ser 


Pro 


Val 


Leu 


Pro 
335 


Val 




Gly 


Glu 


Phe 


Ser 


Glu 


Pro 


Phe 


Val 


His 


Phe 


He 


Thr 


Gin 


Cys 


Met 


Arg 


45 








340 










345 










350 






Thr 


Gin 


Pro 


Lys 


Glu 


Arg 


Pro 


Ala 


Pro 


Glu 


Glu 


Leu 


Met 


Gly His 


Pro 








355 










360 










365 










Phe 


lie 


Val 


Gin 


Phe 


Asn 


Asp 


Gly Asn 


Ala 


Ala 


Val 


Val 


Ser 


Met 


Trp 






370 










375 










380 








50 


Val 


Cys 


Arg 


Ala 


Leu 


Glu 


Glu 


Arg Arg 


Thr 


Ser 


Arg 


Gly 


Pro 


Arg 


Glu 




385 










390 










395 








400 




Ala 


Ala 


Ala 


Gly 


His 

























405 

(2) INFORMATION FOR SEQ ID NO:23: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
ATNGCNGTNA ARCARATG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

ATNCKYTCNG GNGC CATRTA 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 base pairs 

15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

2 0 (A) NAME/KEY: Coding Sequence 

(B) LOCATION: 62 . . . 841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

TGTTTGTCTG CCGGACTGAC GGGCGGCCGG GCGGTGCGCG GCGGCGGTGG CGGCGGGGAA 60 

G ATG GCG GCG TCC TCC CTG GAA CAG AAG CTG TCC CGC CTG GAA GCA AAG 10 9 

25 Met Ala Ala Ser Ser Leu Glu Gin Lys Leu Ser Arg Leu Glu Ala Lys 

1 5 10 15 

CTG AAG CAG GAG AAC CGG GAG GCC CGG CGG AGG ATC GAC CTC AAC CTG 157 
Leu Lys Gin Glu Asn Arg Glu Ala Arg Arg Arg He Asp Leu Asn Leu 
20 25 " 30 

3 0 GAT ATC AGC CCC CAG CGG CCC AGG CCC ACC CTG CAG CTC CCG CTG GCC 2 05 

Asp He Ser Pro Gin Arg Pro Arg Pro Thr Leu Gin Leu Pro Leu Ala 
35 40 45 

AAC GAT GGG GGC AGC CGC TCG CCA TCC TCA GAG AGC TCC CCG CAG CAC 2 53 

Asn Asp Gly Gly Ser Arg Ser Pro Ser Ser Glu Ser Ser Pro Gin His 
35 50 55 60 

CCC ACG CCC CCC GCC CGG CCC CGC CAC ATG CTG GGG CTC CCG TCA ACC 3 01 

Pro Thr Pro Pro Ala Arg Pro Arg His Met Leu Gly Leu Pro Ser Thr 
65 70 75 80 



40 



CTG TTC ACA CCC CGC AGC ATG GAG AGC ATT GAG ATT GAC CAG AAG CTG 34 9 

Leu Phe Thr Pro Arg Ser Met Glu Ser He Glu He Asp Gin Lys Leu 
85 90 ~ 95 

CAG GAG ATC ATG AAG CAG ACG GGC TAC CTG ACC ATC GGG GGC CAG CGC 3 97 

Gin Glu He Met Lys Gin Thr Gly Tyr Leu Thr He Gly Gly Gin Arg 
100 105 110 

45 TAC CAG GCA GAA ATC AAC GAC CTG GAG AAC TTG GGC GAG ATG GGC AGC 445 

Tyr Gin Ala Glu lie Asn Asp Leu Glu Asn Leu Gly Glu Met Gly Ser 
115 120 125 
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GGC ACC TGC GGC CAG GTG TGG AAG ATG CGC TTC CGG AAG ACC GGC CAC 4 93 

Gly Thr Cys Gly Gin Val Trp Lys Met Arg Phe Arg Lys Thr Gly His 
130 135 140 

GTC ATT GCC GTT AAG CAA ATG CGG CGC TCC GGG AAC AAG GAG GAG AAC 541 
5 Val lie Ala Val Lys Gin Met Arg Arg Ser Gly Asn Lys Glu Glu Asn 

145 150 "* 155 " 160 

AAG CGC ATC CTC ATG GAC CTG GAT GTG GTG CTG AAG AGC CAC GAC TGC 58 9 

Lys Arg lie Leu Met Asp Leu Asp Val Val Leu Lys Ser His Asp Cys 
165 170 175 

10 CCC TAC ATC GTG CAG TGC TTT GGG ACG TTC ATC ACC AAC ACG GAC GTC 63 7 

Pro Tyr lie Val Gin Cys Phe Gly Thr Phe lie Thr Asn Thr Asp Val 
180 185 190 

TTC ATC GCC ATG GAG CTC ATG GGC ACC TGC GCT GAG AAG CTC AAG AAG 685 
Phe lie Ala Met Glu Leu Met Gly Thr Cys Ala Glu Lys Leu Lys Lys 
15 195 200 205 

CGG ATG CAG GGC CCC ATC CCC GAG CGC ATT CTG GGC AAG ATG ACA GTG 73 3 

Arg Met Gin Gly Pro lie Pro Glu Arg lie Leu Gly Lys Met Thr Val 
210 215 220 

GCG ATT GTG AAG GCG CTG TAC TAC CTG AAG GAG AAG CAC GGT GTC ATC 781 

2 0 Ala lie Val Lys Ala Leu Tyr Tyr Leu Lys Glu Lys His Gly Val lie 

225 230 235 ~ 240 

CAC CGC GAC GTC AAG CCC TCC AAC ATC CTG CTG GAC GAG CGG GGC CAG 82 9 

His Arg Asp Val Lys Pro Ser Asn lie Leu Leu Asp Glu Arg Gly Gin 
245 250 255 

25 ATC AAG CTG TGC GA 84 3 

lie Lys Leu Cys 
260 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

3 0 (A) LENGTH: 2 60 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



35 




(xi) J 


SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO: 26 : 












Met 


Ala 


Ala 


Ser 


Ser 


Leu 


Glu 


Gin 


Lys 


Leu 


Ser 


Arg 


Leu 


Glu 


Ala 


Lys 




1 








5 










10 










15 




Leu 


Lys 


Gin 


Glu 


Asn 


Arg 


Glu 


Ala 


Arg 


Arg 


Arg 


He 


Asp 


Leu 


Asn 


Leu 










20 










25 










30 






40 


Asp 


He 


Ser 


Pro 


Gin 


Arg 


Pro 


Arg 


Pro 


Thr 


Leu 


Gin 


Leu 


Pro 


Leu 


Ala 








35 










40 










45 










Asn Asp 


Gly 


Gly 


Ser 


Arg 


Ser 


Pro 


Ser 


Ser 


Glu 


Ser 


Ser 


Pro 


Gin 


His 






50 










55 










60 












Pro 


Thr 


Pro 


Pro 


Ala 


Arg 


Pro 


Arg 


His 


Met 


Leu 


Gly 


Leu 


Pro 


Ser 


Thr 


45 


65 










70 










75 








80 




Leu 


Phe 


Thr 


Pro 


Arg 


Ser 


Met 


Glu 


Ser 


He 


Glu 


He 


Asp 


Gin 


Lys 


Leu 












85 










90 








95 






Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr 


Gly 


Tyr 


Leu 


Thr 


He 


Gly 


Gly 


Gin 


Arg 










100 










105 










110 




50 


Tyr 


Gin 


Ala 


Glu 


He 


Asn 


Asp 


Leu 


Glu 


Asn 


Leu 


Gly Glu Met 


Gly 


Ser 








115 










120 










125 








Gly 


Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Met 


Arg 


Phe 


Arg 


Lys 


Thr Gly His 






130 










135 










140 
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V dx 


Tl ~ 
lie 


7\ 1 a 


vai 


Lys 


bin 


ivie u 


Arg 


Arg 


Ser 


Gly Asn 


Lys 


Glu Glu 


Asn 


145 










150 














16 0 


Lys 


Arg 


He 


Leu 


Met 


Asp 


■UC Li 


xt. Id Lv 


Val 


V ct _L 


T .oi i T it r o 

JjcU JjyS 


Ser 


rllS ASp 


Cys 










165 










17 0 






175 


Pro 


Tvr 
iy x 


He 


v ctx 

180 








LjrXy 


inr 
185 


file 


lie mr 


Asn 


Thr Asp 
190 


Val 


Phe 


Tl p 


AT ^ 

nl ct 


1 V 1C L. 


ul Li 


Leu. 






±nr 


Cys 


A±a CjIu 


Lys 


Leu Lys 


Lys 






195 










200 








205 


Arg 


Met 


Gin 


Gly 


Pro 


He 


Pro 


Glu 


Arg 


He 


Leu Gly 


Lys 


Met Thr 


Val 




210 










215 








220 






Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr 


Tyr 


Leu 


Lys 


Glu Lys 


His 


Gly Val 


He 


225 










230 










235 




240 


His 


Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


He 


Leu 


Leu Asp 


Glu Arg Gly 


Gin 










245 










250 






255 




He 


Lys 


Leu 


Cys 
260 























(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1643 base pairs 

20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 
25 (A) NAME / KEY : Coding Sequence 

(B) LOCATION: 82... 1338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AGCGCAGGCG CAGTGCGGTG TTTGTCTACC CCGGACTGAC GGGTGGCCTG GCGGTGAGCG 60 
3 0 GCGGCAGCGG CGGCGGGGAA G ATG GCG GCG TCC TCC CTG GAG CAG AAG CTG 111 

Met Ala Ala Ser Ser Leu Glu Gin Lys Leu 
1 5 10 

TCC CGC CTG GAA GCC AAG CTG AAG CAG GAG AAC CGT GAG GCC CGC AGG 15 9 

Ser Arg Leu Glu Ala Lys Leu Lys Gin Glu Asn Arg Glu Ala Arg Arg 
35 15 20 25 

AGG ATC GAC CTC AAC TTG GAT ATC AGC CCA CAG CGG CCC AGG CCC ACC 207 
Arg He Asp Leu Asn Leu Asp He Ser Pro Gin Arg Pro Arg Pro Thr 
30 35 40 

CTG CAA CTC CCA CTG GCC AAC GAT GGG GGC AGC CGC TCA CCA TCC TCA 25 5 

40 Leu Gin Leu Pro Leu Ala Asn Asp Gly Gly Ser Arg Ser Pro Ser Ser 

45 50 55 

GAG AGC TCC CCA CAG CAC CCT ACA CCC CCC ACC CGG CCC CGC CAC ATG 3 03 

Glu Ser Ser Pro Gin His Pro Thr Pro Pro Thr Arg Pro Arg His Met 
6 0 65 7 0 

45 CTG GGG CTC CCA TCA ACC TTG TTC ACA CCG CGC AGT ATG GAG AGC ATC 351 

Leu Gly Leu Pro Ser Thr Leu Phe Thr Pro Arg Ser Met Glu Ser He 
75 80 85 90 

GAG ATT GAC CAG AAG CTG CAG GAG ATC ATG AAG CAG ACA GGG TAC CTG 3 99 

Glu He Asp Gin Lys Leu Gin Glu He Met Lys Gin Thr Gly Tyr Leu 
50 95 100 105 

ACT ATC GGG GGC CAG CGT TAT CAG GCA GAA ATC AAT GAC TTG GAG AAC 44 7 

Thr He Gly Gly Gin Arg Tyr Gin Ala Glu He Asn Asp Leu Glu Asn 
110 115 120 
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TTG GGT GAG ATG GGC AGT GGT ACC TGT GGT CAG GTG TGG AAG ATG CGG 4 95 

Leu Gly Glu Met Gly Ser Gly Thr Cys Gly Gin Val Trp Lys Met Arg 
125 130 135 

TTC CGG AAG ACA GGC CAC ATC ATT GCT GTT AAG CAA ATG CGG CGC TCT 54 3 

5 Phe Arg Lys Thr Gly His lie lie Ala Val Lys Gin Met Arg Arg Ser 

140 145 150 

GGG AAC AAG GAA GAG AAT AAG CGC ATT TTG ATG GAC CTG GAT GTA GTA 5 91 

Gly Asn Lys Glu Glu Asn Lys Arg lie Leu Met Asp Leu Asp Val Val 
155 160 165 " 170 

10 CTC AAG AGC CAT GAC TGC CCT TAC ATC GTT CAG TGC TTT GGC ACC TTC 63 9 

Leu Lys Ser His Asp Cys Pro Tyr lie Val Gin Cys Phe Gly Thr Phe 
175 180 185 

ATC ACC AAC ACA GAC GTC TTT ATT GCC ATG GAG CTC ATG GGC ACA TGT 6 87 

lie Thr Asn Thr Asp Val Phe lie Ala Met Glu Leu Met Gly Thr Cys 
15 190 195 200 

GCA GAG AAG CTG AAG AAA CGA ATG CAG GGC CCC ATT CCA GAG CGA ATC 73 5 

Ala Glu Lys Leu Lys Lys Arg Met Gin Gly Pro lie Pro Glu Arg lie 
205 210 215 



20 



CTG GGC AAG ATG ACT GTG GCG ATT GTG AAA GCA CTG TAC TAT CTG AAG 7 83 

Leu Gly Lys Met Thr Val Ala lie Val Lys Ala Leu Tyr Tyr Leu Lys 
220 225 230 

GAG AAG CAT GGC GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTG 831 
Glu Lys His Gly Val lie His Arg Asp Val Lys Pro Ser Asn lie Leu 
235 240 245 250 

25 CTA GAT GAG CGG GGC CAG ATC AAG CTC TGT GAC TTT GGC ATC AGT GGC 87 9 

Leu Asp Glu Arg Gly Gin lie Lys Leu Cys Asp Phe Gly lie Ser Gly 
255 260 " 265 

CGC CTT GTT GAC TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC 92 7 

Arg Leu Val Asp Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala 
30 270 275 280 

TAT ATG GCT CCC GAG CGC ATC GAC CCT CCA GAT CCC ACC AAG CCT GAC 975 
Tyr Met Ala Pro Glu Arg lie Asp Pro Pro Asp Pro Thr Lys Pro Asp 
285 290 295 

TAT GAC ATC CGA GCT GAT GTG TGG AGC CTG GGC ATC TCA CTG GTG GAG 1023 

3 5 Tyr Asp lie Arg Ala Asp Val Trp Ser Leu Gly lie Ser Leu Val Glu 

300 305 310 

CTG GCA ACA GGA CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG 1071 
Leu Ala Thr Gly Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu 
315 320 325 ~ 330 

4 0 GTC CTC ACC AAA GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC 1119 

Val Leu Thr Lys Val Leu Gin Glu Glu Pro Pro Leu Leu Pro Gly His 
335 340 345 

ATG GGC TTC TCA GGG GAC TTC CAG TCA TTT GTC AAA GAC TGC CTT ACT 1167 
Met Gly Phe Ser Gly Asp Phe Gin Ser Phe Val Lys Asp Cys Leu Thr 
45 350 355 360 

AAA GAT CAC AGG AAG AGA CCA AAG TAT AAT AAG CTA CTT GAA CAC AGC 1215 
Lys Asp His Arg Lys Arg Pro Lys Tyr Asn Lys Leu Leu Glu His Ser 
365 370 375 
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TTC ATC AAG CAC TAT GAG ATA CTC GAG GTG GAT GTC GCG TCC TGG TTT 12 63 

Phe lie Lys His Tyr Glu lie Leu Glu Val Asp Val Ala Ser Trp Phe 
380 385 390 

AAG GAT GTC ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC CTG 1311 
5 Lys Asp Val Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val Leu 

395 400 405 ~ 410 

AGT CAG CAC CAT CTG CCC TTC TTC AGG TAGCCTCATG GCAGCGGCCA GCCCCGC 13 65 
Ser Gin His His Leu Pro Phe Phe Arg 
415 

10 AGGGGCCCCG GGCCACGGCC ACCGACCCCC CCCCCAACCT GGCCAACCCA GCTGCCCATC 142 5 

AGGGGACCTG GGACCTGGAC GACTGCCAAG GACTGAGGAC AGAAAGTAGG GGGTTCCCAT 14 85 

CCAGCTCTGA CTCCCTGCCT ACCAGCTGTG GACAAAAGGG CATGCTGGTT CCTAATCCCT 154 5 

CCCACTCTGG GGTCAGCCAG CAGTGTGAGC CCCATCCCAC CCCGACAGAC ACTGTGAACG 160 5 

GAAGACAGCA GGCCAAAAAA AAAAAAAAAA AAAAAAAA 1643 

15 (2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



25 



30 



35 



40 



45 



50 



55 





(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO:: 


28 : 










Met 


Ala 


Ala 


Ser 


Ser 


Leu 


Glu 


Gin 


Lys 


Leu 


Ser 


Arg 


Leu 


Glu 


Ala 


Lys 


1 








5 










10 










15 


Leu 


Lys 


Gin 


Glu 


Asn 


Arg 


Glu 


Ala 


Arg 


Arg 


Arg 


He 


Asp 


Leu 


Asn 


Leu 








20 










25 










30 






Asp 


He 


Ser 


Pro 


Gin 


Arg 


Pro 


Arg 


Pro 


Thr 


Leu 


Gin 


Leu 


Pro 


Leu 


Ala 






35 










40 










45 








Asn 


Asp 


Gly 


Gly 


Ser 


Arg 


Ser 


Pro 


Ser 


Ser 


Glu 


Ser 


Ser 


Pro 


Gin 


His 




50 










55 










60 










Pro 


Thr 


Pro 


Pro 


Thr 


Arg 


Pro 


Arg 


His 


Met 


Leu 


Gly 


Leu 


Pro 


Ser 


Thr 


65 










70 










75 








80 


Leu 


Phe 


Thr 


Pro 


Arg 


Ser 


Met 


Glu 


Ser 


He 


Glu 


He 


Asp 


Gin 


Lys 


Leu 










85 










90 










95 




Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr 


Gly 


Tyr 


Leu 


Thr 


He 


Gly 


Gly 


Gin 


Arg 








100 










105 










110 




Tyr 


Gin 


Ala 


Glu 


He 


Asn 


Asp 


Leu 


Glu 


Asn 


Leu 


Gly 


Glu 


Met 


Gly 


Ser 






115 










120 










125 






Gly 


Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Met 


Arg 


Phe 


Arg 


Lys 


Thr 


Gly 


His 




130 










135 










140 










He 


He 


Ala 


Val 


Lys 


Gin 


Met 


Arg 


Arg 


Ser 


Gly 


Asn 


Lys 


Glu 


Glu 


Asn 


145 










150 










155 








160 


Lys 


Arg 


He 


Leu 


Met 


Asp 


Leu 


Asp 


Val 


Val 


Leu 


Lys 


Ser 


His 


Asp 


Cys 










165 










170 










175 


Pro 


Tyr 


He 


Val 


Gin 


Cys 


Phe 


Gly 


Thr 


Phe 


He 


Thr 


Asn 


Thr 


Asp 


Val 








180 










185 










190 




Phe 


He 


Ala 


Met 


Glu 


Leu 


Met 


Gly 


Thr 


Cys 


Ala 


Glu 


Lys 


Leu 


Lys 


Lys 






195 










200 










205 




Arg 


Met 


Gin 


Gly 


Pro 


He 


Pro 


Glu 


Arg 


He 


Leu 


Gly Lys 


Met 


Thr 


Val 




210 










215 










220 










Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr 


Tyr 


Leu 


Lys 


Glu 


Lys 


His 


Gly Val 


He 


225 










230 










235 








240 


His 


Arg 


Asp 


Val 


Lys 


Pro 


Ser 


Asn 


He 


Leu 


Leu 


Asp 


Glu Arg Gly Gin 










245 










250 










255 




He 


Lys 


Leu 


Cys 


Asp 


Phe 


Gly 


He 


Ser 


Gly 


Arg 


Leu 


Val 


Asp 


Ser 


Lys 








260 










265 










270 




Ala 


Lys 


Thr 


Arg 


Ser 


Ala 


Gly 


Cys 


Ala 


Ala 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 






275 










280 










285 
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lie 


Asp 


PrO 


Pro 


Asp 


Pro 


xnr 


Lys 


Pro 


Asp 


Tyr Asp 


He 


Arg Ala 


Asp 




290 




















300 






vai 


ir P 


O fc: X 


T 

JjGU 




lie 


Ser 


Leu 


vai 


LilU 


Leu 


Ala 


Thr Gly Gin 


Phe 


305 










310 










315 








*2 o n 
J A U 


Pro 


±yi 




A en 


v_ _y & 
325 


Lys 


Th-r 


Asp 


rile 


blU 

330 


vai 


Leu 


Thr 


Lys Val 
335 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


Leu 


Leu 


Pro 


Gly 


His 


Met 


Gly 


Phe 


Ser Gly 


Asp 








340 










345 










350 


Phe 


Gin 


Ser 


Phe 


Val 


Lys 


Asp 


Cys 


Leu 


Thr 


Lys 


Asp 


His 


Arg Lys 


Arg 






355 










360 










365 


Pro 


Lys 


Tyr 


Asn 


Lys 


Leu 


Leu 


Glu 


His 


Ser 


Phe 


He 


Lys 


His Tyr 


Glu 




370 










375 










380 




He 


Leu 


Glu 


Val 


Asp 


Val 


Ala 


Ser 


Trp 


Phe 


Lys 


Asp 


Val 


Met Ala 


Lys 


385 










390 










395 








400 


Thr 


Glu 


Ser 


Pro 


Arg 


Thr 


Ser 


Gly 


Val 


Leu 


Ser 


Gin 


His 


His Leu 


Pro 



405 410 415 

Phe Phe Arg 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

2 5 (ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 281... 1420 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GGAAAGGCAG CCTCCTGTAG GTGAAAATTC TGTTCACTAC CTGGCCACCT GGCCTGACTG 60 

30 ACCTTCACAG CTTGATCATC TTCCTGAAGA GGCATTCAGG ATTCCCTCCA TCCCTACCCC 12 0 

TTCTGGACAA AGTCTTCCAC GTTTCCTTCC TGGGAGTTTC TTCCAGGAAC TGGAGATACC 18 0 

CAGAGCCCTG CAACTCCCAC TGGCCAACGA TGGGGGCAGC CGCTCACCAT CCTCAGAGAG 240 

CTCCCCACAG CACCCTACAC CCCCCACCCG GCCCCGCCAC ATG CTG GGG CTC CCA 2 95 

Met Leu Gly Leu Pro 

35 15 

TCA ACC TTG TTC ACA CCG CGC AGT ATG GAG AGC ATC GAG ATT GAC CAG 3 43 

Ser Thr Leu Phe Thr Pro Arg Ser Met Glu Ser He Glu He Asp Gin 
10 15 20 

AAG CTG CAG GAG ATC ATG AAG CAG ACA GGG TAC CTG ACT ATC GGG GGC 3 91 

40 Lys Leu Gin Glu He Met Lys Gin Thr Gly Tyr Leu Thr He Gly Gly 

25 30 35 

CAG CGT TAT CAG GCA GAA ATC AAT GAC TTG GAG AAC TTG GGT GAG ATG 43 9 

Gin Arg Tyr Gin Ala Glu He Asn Asp Leu Glu Asn Leu Gly Glu Met 
40 45 50 

4 5 GGC AGT GGT ACC TGT GGT CAG GTG TGG AAG ATG CGG TTC CGG AAG ACA 4 87 

Gly Ser Gly Thr Cys Gly Gin Val Trp Lys Met Arg Phe Arg Lys Thr 
55 60 65 

GGC CAC ATC ATT GCT GTT AAG CAA ATG CGG CGC TCT GGG AAC AAG GAA 535 
Gly His He He Ala Val Lys Gin Met Arg Arg Ser Gly Asn Lys Glu 
50 70 75 80 85 

GAG AAT AAG CGC ATT TTG ATG GAC CTG GAT GTA GTA CTC AAG AGC CAT 5 83 

Glu Asn Lys Arg He Leu Met Asp Leu Asp Val Val Leu Lys Ser His 
90 95 100 
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GAC TGC CCT TAC ATC GTT CAG TGC TTT GGC ACC TTC ATC ACC AAC ACA 631 
Asp Cys Pro Tyr lie Val Gin Cys Phe Gly Thr Phe lie Thr Asn Thr 
105 110 115 

GAC GTC TTT ATT GCC ATG GAG CTC ATG GGC ACA TGT GCA GAG AAG CTG 67 9 

5 Asp Val Phe lie Ala Met Glu Leu Met Gly Thr Cys Ala Glu Lys Leu 

120 125 130 

AAG AAA CGA ATG CAG GGC CCC ATT CCA GAG CGA ATC CTG GGC AAG ATG 727 
Lys Lys Arg Met Gin Gly Pro lie Pro Glu Arg lie Leu Gly Lys Met 
135 140 145 

10 ACT GTG GCG ATT GTG AAA GCA CTG TAC TAT CTG AAG GAG AAG CAT GGC 775 

Thr Val Ala lie Val Lys Ala Leu Tyr Tyr Leu Lys Glu Lys His Gly 
150 155 160 ~ 165 

GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTG CTA GAT GAG CGG 82 3 

Val lie His Arg Asp Val Lys Pro Ser Asn lie Leu Leu Asp Glu Arg 
15 170 175 180 

GGC CAG ATC AAG CTC TGT GAC TTT GGC ATC AGT GGC CGC CTT GTT GAC 871 
Gly Gin lie Lys Leu Cys Asp Phe Gly lie Ser Gly Arg Leu Val Asp 
185 190 * 195 

TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC TAT ATG GCT CCC 919 
2 0 Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala Tyr Met Ala Pro 

200 205 210 

GAG CGC ATC GAC CCT CCA GAT CCC ACC AAG CCT GAC TAT GAC ATC CGA 967 
Glu Arg lie Asp Pro Pro Asp Pro Thr Lys Pro Asp Tyr Asp lie Arg 
215 220 225 

25 GCT GAT GTG TGG AGC CTG GGC ATC TCA CTG GTG GAG CTG GCA ACA GGA 1015 

Ala Asp Val Trp Ser Leu Gly lie Ser Leu Val Glu Leu Ala Thr Gly 
230 235 240 245 

CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG GTC CTC ACC AAA 1063 
Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu Val Leu Thr Lys 
30 250 255 260 

GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC ATG GGC TTC TCA 1111 
Val Leu Gin Glu Glu Pro Pro Leu Leu Pro Gly His Met Gly Phe Ser 
265 270 275 

GGG GAC TTC CAG TCA TTT GTC AAA GAC TGC CTT ACT AAA GAT CAC AGG 1159 
35 Gly Asp Phe Gin Ser Phe Val Lys Asp Cys Leu Thr Lys Asp His Arg 

280 285 290 

AAG AGA CCA AAG TAT AAT AAG CTA CTT GAA CAC AGC TTC ATC ATC AAG 12 07 

Lys Arg Pro Lys Tyr Asn Lys Leu Leu Glu His Ser Phe lie lie Lys 
295 300 305 

4 0 CAC TAT GAG ATA CTC GAG GTG GAT GTC GCG TCC TGG TTT AAG GAT GTC 1255 

His Tyr Glu lie Leu Glu Val Asp Val Ala Ser Trp Phe Lys Asp Val 
310 315 320 " 325 

ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC CTG AGT CAG CAC 13 0 3 

Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val Leu Ser Gin His 
45 330 ~~ 335 '* 340 

CAT CTG CCC TTC TTC AGT GGG AGT CTG GAG GAG TCT CCC ACT TCC CCA 1351 
His Leu Pro Phe Phe Ser Gly Ser Leu Glu Glu Ser Pro Thr Ser Pro 
345 350 355 

CCT TCT CCC AAG TCC TTC CCT CTG TCA CCA GCC ATC CCT CAG GCC CAG 13 99 

50 Pro Ser Pro Lys Ser Phe Pro Leu Ser Pro Ala lie Pro Gin Ala Gin 

360 365 370 
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GCA GAG TGG GTC TCG GGC AGG TAGGGACCTG GAGTGGCCTG GTCCCACCCT CTGA 1454 
Ala Glu Trp Val Ser Gly Arg 
375 380 

CCTCCTCCTC AGGCCACCAG TGTTGCCCTC TTCCCTTTTT AAAACAAAAT ACCCTTGTTT 1514 

5 GTAAATCCTT AGACGCTTGA GAATAAAACC CTTCCCTTTT CTTCCGAAAA AAAAAAAAAA 15 74 

AAAA 1578 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 80 amino acids 

10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



1 R 

J- Zj 


1 


Leu 


\j±y 


Leu 


Pro 

c: 
3 


Ser 


Thr 


Leu 


Phe 


Thr 

± U 


Pro 


Arg 


Ser 


Met 


Glu 
15 


Ser 




Tip 


ulU 


J. ± e 


Asp 


<jj_n 


Lys 


Leu 


Gin 


Glu 


lie 


Met 


Lys 


Gin 


Thr 


Gly 


Tyr 










2 0 










25 










30 




Leu 


TVi -v 

i nr 


i._Le 


\j-Ly 




{j±Ll 


Arg 


Tyr 


Gin 


7\ "I _ 

Ala 


Glu 


lie 


Asn 


Asp 


Leu 


Glu 


20 






35 










40 










45 








Asn 


Leu 


or±y 


vjj-LU 


lYieu 


Lily 


Ser 


Gly 


Thr 


Cys 


Gly 


Gin 


Val 


Trp 


Lys 


Met 






5 0 










D O 




















Arg 




Arg 


Lys 


Tnr 


{j±y 


XT-5 t-i 

xilS 


He 


He 


Ala 


Val 


Lys 


Gin 


Met 


Arg 


Arg 




D ~D 










/ u 










/ 3 








80 




OCX 


uiy 


Asn 


Lys 


r*i n 

ulU 


ulU 


Asn 


Lys 


Arg 


lie 


Leu 




Asp 


Leu Asp 


Val 












85 










90 










95 






Val 


Leu 


Lys 


Ser 


His 


Asp 


Cys 


Pro 


Tyr 


He 


Val 


Gin 


Cys 


Phe 


Gly Thr 










100 










105 










110 








Phe 


He 


Thr 


Asn 


Thr 


Asp 


Val 


Phe 


He 


Ala 


Met 


Glu 


Leu 


Met 


Gly Thr 


30 






115 










120 










125 










Cys 


Ala 


Glu 


Lys 


Leu 


Lys 


Lys 


Arg Met 


Gin 


Gly 


Pro 


He 


Pro 


Glu Arg 






130 










135 










140 












He 


Leu 


Gly 


Lys 


Met 


Thr 


Val 


Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr Tyr 


Leu 




145 










150 










155 










160 


35 


Lys 


Glu 


Lys 


His 


Gly 
165 


Val 


He 


His 


Arg 


Asp 
170 


Val 


Lys 


Pro 


Ser 


Asn 
175 


He 




Leu 


Leu 


Asp 


Glu 
180 


Arg 


Gly 


Gin 


He 


Lys 
185 


Leu 


Cys 


Asp 


Phe 


Gly 
190 


He 


Ser 




Gly 


Arg 


Leu 


Val 


Asp 


Ser 


Lys 


Ala 


Lys 


Thr 


Arg 


Ser 


Ala 


Gly Cys 


Ala 


40 






195 










200 










205 










Ala 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


He 


Asp 


Pro 


Pro 


Asp 


Pro 


Thr 


Lys 


Pro 






210 










215 










220 










Asp 


Tyr 


Asp 


He 


Arg 


Ala 


Asp 


Val 


Trp 


Ser 


Leu 


Gly 


He 


Ser 


Leu 


Val 




225 










230 










235 








240 


45 


Glu 


Leu 


Ala 


Thr 


Gly 


Gin 


Phe 


Pro 


Tyr 


Lys 


Asn 


Cys 


Lys 


Thr Asp 


Phe 












245 










250 








255 






Glu 


Val 


Leu 


Thr 


Lys 


Val 


Leu 


Gin 


Glu 


Glu 


Pro 


Pro 


Leu 


Leu 


Pro 


Gly 










260 










265 










270 




50 


His 


Met 


Gly 


Phe 


Ser 


Gly 


Asp 


Phe 


Gin 


Ser 


Phe 


Val 


Lys 


Asp 


Cys 


Leu 






275 










280 










285 






Thr 


Lys 
290 


Asp 


His 


Arg 


Lys 


Arg 
295 


Pro 


Lys 


Tyr 


Asn 


Lys 
300 


Leu 


Leu 


Glu 


His 




Ser 


Phe 


He 


He 


Lys 


His 


Tyr 


Glu 


He 


Leu 


Glu 


Val 


Asp 


Val 


Ala 


Ser 




305 










310 










315 








320 


55 


Trp 


Phe 


Lys 


Asp 


Val 


Met 


Ala 


Lys 


Thr 


Glu 


Ser 


Pro 


Arg 


Thr 


Ser 


Gly 












325 










330 








335 




Val 


Leu 


Ser 


Gin 


His 


His 


Leu 


Pro 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Glu 


Glu 



340 345 350 
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Ser Pro Thr Ser Pro Pro Ser Pro Lys Ser Phe Pro Leu Ser Pro Ala 

355 360 * 365 

lie Pro Gin Ala Gin Ala Glu Trp Val Ser Gly Arg 
370 375 380 

5 (2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 82 . . . 1440 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AGCGCAGGCG CAGTGCGGTG TTTGTCTACC CCGGACTGAC GGGTGGCCTG GCGGTGAGCG 6 0 

GCGGCAGCGG CGGCGGGGAA G ATG GCG GCG TCC TCC CTG GAG CAG AAG CTG 111 

Met Ala Ala Ser Ser Leu Glu Gin Lys Leu 
1 5 ' 10 

2 0 TCC CGC CTG GAA GCC AAG CTG AAG CAG GAG AAC CGT GAG GCC CGC AGG 15 9 

Ser Arg Leu Glu Ala Lys Leu Lys Gin Glu Asn Arg Glu Ala Arg Arg 
15 20 25 

AGG ATC GAC CTC AAC TTG GAT ATC AGC CCA CAG CGG CCC AGG CCC ACC 207 
Arg lie Asp Leu Asn Leu Asp lie Ser Pro Gin Arg Pro Arg Pro Thr 
25 30 35 ~ 40 

CTG CAA CTC CCA CTG GCC AAC GAT GGG GGC AGC CGC TCA CCA TCC TCA 255 
Leu Gin Leu Pro Leu Ala Asn Asp Gly Gly Ser Arg Ser Pro Ser Ser 
45 50 55 

GAG AGC TCC CCA CAG CAC CCT ACA CCC CCC ACC CGG CCC CGC CAC ATG 3 03 

3 0 Glu Ser Ser Pro Gin His Pro Thr Pro Pro Thr Arg Pro Arg His Met 

60 65 70 

CTG GGG CTC CCA TCA ACC TTG TTC ACA CCG CGC AGT ATG GAG AGC ATC 3 51 

Leu Gly Leu Pro Ser Thr Leu Phe Thr Pro Arg Ser Met Glu Ser lie 
75 80 85 90 

3 5 GAG ATT GAC CAG AAG CTG CAG GAG ATC ATG AAG CAG ACA GGG TAC CTG 3 99 

Glu lie Asp Gin Lys Leu Gin Glu lie Met Lys Gin Thr Gly Tyr Leu 
95 100 " 105 

ACT ATC GGG GGC CAG CGT TAT CAG GCA GAA ATC AAT GAC TTG GAG AAC 44 7 

Thr lie Gly Gly Gin Arg Tyr Gin Ala Glu lie Asn Asp Leu Glu Asn 
40 110 115 120 

TTG GGT GAG ATG GGC AGT GGT ACC TGT GGT CAG GTG TGG AAG ATG CGG 4 95 

Leu Gly Glu Met Gly Ser Gly Thr Cys Gly Gin Val Trp Lys Met Arg 
125 130 135 

TTC CGG AAG ACA GGC CAC ATC ATT GCT GTT AAG CAA ATG CGG CGC TCT 54 3 

4 5 Phe Arg Lys Thr Gly His lie lie Ala Val Lys Gin Met Arg Arg Ser 

140 145 150 

GGG AAC AAG GAA GAG AAT AAG CGC ATT TTG ATG GAC CTG GAT GTA GTA 5 91 

Gly Asn Lys Glu Glu Asn Lys Arg lie Leu Met Asp Leu Asp Val Val 
155 160 " 165 " " 170 
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CTC AAG AGC CAT GAC TGC CCT TAC ATC GTT CAG TGC TTT GGC ACC TTC 639 

Leu Lys Ser His Asp Cys Pro Tyr lie Val Gin Cys Phe Gly Thr Phe 
175 180 185 

ATC ACC AAC ACA GAC GTC TTT ATT GCC ATG GAG CTC ATG GGC ACA TGT 6 87 

5 lie Thr Asn Thr Asp Val Phe lie Ala Met Glu Leu Met Gly Thr Cys 

190 195 200 

GCA GAG AAG CTG AAG AAA CGA ATG CAG GGC CCC ATT CCA GAG CGA ATC 735 
Ala Glu Lys Leu Lys Lys Arg Met Gin Gly Pro lie Pro Glu Arg lie 
205 210 215 

10 CTG GGC AAG ATG ACT GTG GCG ATT GTG AAA GCA CTG TAC TAT CTG AAG 7 83 

Leu Gly Lys Met Thr Val Ala lie Val Lys Ala Leu Tyr Tyr Leu Lys 
220 225 * 230 

GAG AAG CAT GGC GTC ATC CAT CGC GAT GTC AAA CCC TCC AAC ATC CTG 831 
Glu Lys His Gly Val lie His Arg Asp Val Lys Pro Ser Asn lie Leu 
15 235 240 ~ 245 250 

CTA GAT GAG CGG GGC CAG ATC AAG CTC TGT GAC TTT GGC ATC AGT GGC 87 9 

Leu Asp Glu Arg Gly Gin lie Lys Leu Cys Asp Phe Gly lie Ser Gly 
255 260 ' 265 

CGC CTT GTT GAC TCC AAA GCC AAA ACA CGG AGT GCT GGC TGT GCT GCC 92 7 

2 0 Arg Leu Val Asp Ser Lys Ala Lys Thr Arg Ser Ala Gly Cys Ala Ala 

270 275 280 

TAT ATG GCT CCC GAG CGC ATC GAC CCT CCA GAT CCC ACC AAG CCT GAC 975 
Tyr Met Ala Pro Glu Arg lie Asp Pro Pro Asp Pro Thr Lys Pro Asp 
285 290 295 

25 TAT GAC ATC CGA GCT GAT GTG TGG AGC CTG GGC ATC TCA CTG GTG GAG 1023 

Tyr Asp lie Arg Ala Asp Val Trp Ser Leu Gly lie Ser Leu Val Glu 
300 305 310 

CTG GCA ACA GGA CAG TTC CCC TAT AAG AAC TGC AAG ACG GAC TTT GAG 1071 
Leu Ala Thr Gly Gin Phe Pro Tyr Lys Asn Cys Lys Thr Asp Phe Glu 
30 315 320 325 330 

GTC CTC ACC AAA GTC CTA CAG GAA GAG CCC CCA CTC CTG CCT GGT CAC 1119 
Val Leu Thr Lys Val Leu Gin Glu Glu Pro Pro Leu Leu Pro Gly His 
335 340 345 

ATG GGC TTC TCA GGG GAC TTC CAG TCA TTT GTC AAA GAC TGC CTT ACT 1167 
35 Met Gly Phe Ser Gly Asp Phe Gin Ser Phe Val Lys Asp Cys Leu Thr 

350 355 360 

AAA GAT CAC AGG AAG AGA CCA AAG TAT AAT AAG CTA CTT GAA CAC AGC 1215 
Lys Asp His Arg Lys Arg Pro Lys Tyr Asn Lys Leu Leu Glu His Ser 
365 370 375 

4 0 TTC ATC ATC AAG CAC TAT GAG ATA CTC GAG GTG GAT GTC GCG TCC TGG 1263 

Phe lie lie Lys His Tyr Glu lie Leu Glu Val Asp Val Ala Ser Trp 
380 385 390 

TTT AAG GAT GTC ATG GCG AAG ACC GAG TCC CCA AGG ACT AGT GGA GTC 1311 
Phe Lys Asp Val Met Ala Lys Thr Glu Ser Pro Arg Thr Ser Gly Val 
45 395 400 ~ 405 " 410 

CTG AGT CAG CAC CAT CTG CCC TTC TTC AGT GGG AGT CTG GAG GAG TCT 1359 
Leu Ser Gin His His Leu Pro Phe Phe Ser Gly Ser Leu Glu Glu Ser 
415 420 425 

CCC ACT TCC CCA CCT TCT CCC AAG TCC TTC CCT CTG TCA CCA GCC ATC 140 7 

50 Pro Thr Ser Pro Pro Ser Pro Lys Ser Phe Pro Leu Ser Pro Ala lie 

430 " 435 440 
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CCT CAG GCC CAG GCA GAG TGG GTC TCG GGC AGG TAGGGACCTG GAGTGGCCTG 14 60 
Pro Gin Ala Gin Ala Glu Trp Val Ser Gly Arg 
445 450 

GTCCCACCCT CTGACCTCCT CCTCAGGCCA CCAGTGTTGC CCTCTTCCCT TTTTAAAACA 152 0 

AAATACCCTT GTTTGTAAAT CCTTAGACGC TTGAGAATAA AACCCTTCCC TTTTCTTCCG 158 0 

AAAAAAAAAA AAAAAAAA 1598 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 453 amino acids 

10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



JLD 


TUT 4— 

Met 
1 


A_La 


A_i_a 


Ser 


Ser 
5 


Leu 


Glu 




Leu 


Lys 




blU 
A u 


Asn 


Arg 


Glu 




Asp 


Tl c 

.L_Le 


Ser 


Pro 


oin 


Arg 


Pro 


20 






35 












Asn 


Asp 

3 U 


(jly 


Gly 


Ser 


Arg 


Ser 
55 




Pro 


Thr 


Pro 


Pro 


Thr 


Arg 


Pro 




65 










70 






Leu 




inr 


Pro 


Arg 
ob 


Ser 


Met 




Gin 


Glu 


He 


Met 


Lys 


Gin 


Thr 










100 








Tyr 


Gin 


Ala 


Glu 


He 


Asn 


Asp 


30 






115 










Gly 


Thr 
130 


Cys 


Gly 


Gin 


Val 


Trp 
135 




He 


He 


Ala 


Val 


Lys 


Gin 


Met 




145 








150 




35 


Lys 


Arg 


He 


Leu 


Met 
165 


Asp 


Leu 




Pro 


Tyr 


He 


Val 
180 


Gin 


Cys 


Phe 




Phe 


He 


Ala 


Met 


Glu 


Leu 


Met 


40 






195 












Arg 


Met 
210 


Gin 


Gly 


Pro 


He 


Pro 
215 




Ala 


He 


Val 


Lys 


Ala 


Leu 


Tyr 




225 










230 




45 


His 


Arg 


Asp 


Val 


Lys 
245 


Pro 


Ser 




He 


Lys 


Leu 


Cys 
260 


Asp 


Phe 


Gly 




Ala 


Lys 


Thr 


Arg 


Ser 


Ala 


Gly 


50 






275 












He 


Asp 
290 


Pro 


Pro 


Asp 


Pro 


Thr 
295 




Val 


Trp 


Ser 


Leu 


Gly 


He 


Ser 




305 










310 




55 


Pro 


Tyr 


Lys 


Asn 


Cys 
325 


Lys 


Thr 




Gin 


Glu 


Glu 


Pro 


Pro 


Leu 


Leu 



340 





Lys 


Leu 


Ser 


Arg 


Leu 


Glu 


Ala 


Lys 






T Pi 
J. U 










15 


Til - 


Arg 


Arg 


Arg 


He 


Asp 


Leu 


Asn 


Leu 














30 






Arg 


Pro 


Thr 


Leu 


Gin 


Leu 


Pro 


Leu 


Ala 


a n 










45 








Pro 


Ser 


Ser 


Glu 


Ser 
60 


Ser 


Pro 


Gin 


His 


Arg 


rll S 


Met 


Leu 


Gly 


Leu 


Pro 


Ser 


Thr 








•"7 n 










80 


KJ J-U 


Ser 


lie 


(jJ_U 


He 


Asp 


Gin 


Lys 


Leu 






90 










95 




Gly 


Tyr 


Leu 


Thr 


He 


Gly Gly 


Gin 


Arg 




105 










110 




Leu 


Glu 


Asn 


Leu 


Gly Glu Met 


Gly 


Ser 


120 










125 






Lys 


Met 


Arg 


Phe 


Arg 


Lys 


Thr 


Gly His 










140 










Arg 


Arg 


Ser 


Gly 


Asn 


Lys 


Glu 


Glu 


Asn 








155 










160 


Asp 


Val 


Val 


Leu 


Lys 


Ser 


His 


Asp 


Cys 






170 










175 




Gly 


Thr 


Phe 


He 


Thr 


Asn 


Thr 


Asp 


Val 




185 










190 




Gly 


Thr 


Cys 


Ala 


Glu 


Lys 


Leu 


Lys 


Lys 


200 










205 






Glu 


Arg 


He 


Leu 


Gly 


Lys 


Met 


Thr 


Val 










220 










Tyr 


Leu 


Lys 


Glu 


Lys 


His 


Gly Val 


He 








235 










240 


Asn 


He 


Leu 


Leu 


Asp 


Glu Arg Gly Gin 






250 










255 




He 


Ser 


Gly 


Arg 


Leu 


Val 


As P 


Ser 


Lys 




265 










270 




Cys 


Ala 


Ala 


Tyr 


Met 


Ala 


Pro 


Glu 


Arg 


280 










285 






Lys 


Pro 


Asp 


Tyr 


Asp 


lie 


Arg 


Ala 


Asp 










300 








Leu 


Val 


Glu 


Leu 


Ala 


Thr 


Gly 


Gin 


Phe 








315 








320 


Asp 


Phe 


Glu 


Val 


Leu 


Thr 


Lys 


Val 


Leu 






330 








335 




Pro 


Gly 


His 


Met 


Gly 


Phe 


Ser 


Gly Asp 



345 350 
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Phe 


Gin 


Ser 


Phe 


Val 


Lys 


Asp 


Cys 


Leu 


Thr 


Lys 


Asp 


His 


Arg 


Lys 


Arg 








355 










360 










365 






Pro 


Lys 


Tyr 


Asn 


Lys 


Leu 


Leu 


Glu 


His 


Ser 


Phe 


He 


He 


Lys 


His 


Tyr 






370 










375 










380 






5 


Glu 


lie 


Leu 


Glu 


Val 


Asp 


Val 


Ala 


Ser 


Trp 


Phe 


Lys 


Asp 


Val 


Met 


Ala 




385 










390 










395 






400 




Lys 


Thr 


Glu 


Ser 


Pro 
405 


Arg 


Thr 


Ser 


Gly 


Val 
410 


Leu 


Ser 


Gin 


His 


His 
415 


Leu 




Pro 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Glu 


Glu 


Ser 


Pro 


Thr 


Ser 


Pro 


Pro 


Ser 


10 








420 










425 










430 








Pro 


Lys 


Ser 
435 


Phe 


Pro 


Leu 


Ser 


Pro 
440 


Ala 


He 


Pro 


Gin 


Ala 
445 


Gin 


Ala 


Glu 




Trp 


Val 
450 


Ser 


Gly 


Arg 

























15 (2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

2 0 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Met Xaa Ser Pro Ala Pro Ala Pro 

1 5 
Pro Leu Ala Asn Asp Gly Gly Ser 
25 20 

Pro Gin His Pro Thr Pro Pro Thr 
35 40 



Ser Gin Arg Ala Ala Leu Gin Leu 

10 15 
Arg Ser Pro Ser Ser Glu Ser Ser 

25 30 
Arg Pro Arg His 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



35 



40 



Glu 


Gly 


Gly 


Gly Val 


Lys 


His 


Met 


Ala 


Lys 


Leu Tyr Val Phe 


Tyr 


Gly 


1 








5 










10 




15 


Ala 


Gly 


Cys 


Met 


Glu 


Met 


Ser Asp 


He 


Glu 


Leu Leu Leu His 


Arg 


Asp 








20 










25 




30 


Lys 


Pro 


Asn 


Leu 


Gly 


Lys 


Cys 


Asp 


Phe 


Gly 


Ser Gly Leu Ser 


Ala 


Gly 






35 










40 






45 




Tyr 


Met 


Pro 


Glu 


Arg 


Tyr Val 


Ser Asp 


Trp 


Ser Gly Glu Ala Arg 


Pro 




50 










55 








60 






Phe 


Leu 


Val 


Pro 


Leu 


Phe 


Phe 


Cys 


Leu 


Lys 


Arg Leu His 






65 










70 










75 
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Electronic data bases consulted (Name of data base and where practicable terms used): 
APS; STN (file: Biosis, Registry) 

Search Terms: mitogen activated protein kinase kinase; SEQ ID NO: 18, 20, 26, 28, 30, and 32 

BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple inventions as follows: 

This application contains the following inventions or groups 
of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all 
inventions to be searched, the appropriate additional search fees 
must be paid. 



Group I, claim (s) 1 and 2, draawn to mammalian mitogen- 
activated protein kinase kinase (MKK) ; and claims 10-12, drawn 
to method of measuring the activity of a mitogen-activated 
protein kinase kinase (MKK7) . 

Group II, claim (s) 3-5, drawn to isolated polynucleotide 
encoding MKK; claim 6, drawn torecombinant expression vector; and 
claim 7, drawn to a host cell. 

Group III, claim (s) 8 and 9, drawn to purified antibodies; 
and claim 13, drawn to an immuno-based method for measuring the 
synthesis of MKK 7 . 

Group IV, claim (s) 14, drawn to nucleic acid-based method for 
measuring the synthesis of MKK7 . 

Group V, claim (s) 15-17, drawn to a method for identifying 
a reagent that moduates MKK 7 activity; claims 18 and 19, drawn 
to a method for identifying a reagent that modulates MKK 7 
synthesis; and claim 20, drawn to a nucleic acid-based method for 
identifying a reagent that modulates MKK 7 expression. 

Group VI, claim (s) 21-24, drawn to a method of treating a 
MKK7 -mediated disorder. 

The inventions listed as Groups I -VI do not relate to a 
single inventive concept under PCT Rule 13.1 because, under PCT 
Rule 13.2, they lack the same or corresponding special technical 
features for the following reasons: The invention of Group I is 
drawn to an enzyme and a related emthod of use while the 
invnetion of Group II is drawn to a nucleic acid, a recombinant 
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vector, and a life form- a host cell. Further, the invention of 
Group VI is drawn to a method of treating patients that may 
suffer from ischemic heart disease, kidney failure, oxidative 
liver damage, respiratory distress syndrome, heat radiation 
burns, etc., whereinsaid method requires the use of a drug. Said 
drug, however, is not the enzyme of Group I nor the nucleic acid, 
vector, or host cell of Group II. Further, the aspect of mitogen 
activated kinase kinase contributing to such diseases is known 
in the prior art. Accordingly, the claimed invnetions are not 
so inked by a special technical feature so to form a single 
inventive concept under PCT Rule 13.1. 
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